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EDITORIAL 


No democracy, no academia 


ver the past 30 weeks, Israel has been undergoing 
an upheaval marked by unprecedented attacks by 
the government on the independence of its judi- 
ciary, attorney general, government legal advisers, 
police, military, public broadcasting, and religious 
freedom. This assault on democratic institutions 
and principles is an imminent threat to Israeli 
academia, which relies on a solid democratic foundation. 
In response, universities, academics, and students have 
emerged as key proponents of ongoing protests under the 
banner, “No democracy, no academia.” 

The relationship between academia and democracy 
is bidirectional. Most highly ranked academic institu- 
tions predominantly operate within liberal democracies. 
Countries experiencing democratic 
backsliding, such as Turkey, Poland, 
and Hungary, witness a decline in 
academic achievements. In parallel, a 
robust democracy depends on a free 
academia that introduces new, liberal, 
free, and critical thinking into society. 
Erosion of this two-way association is 
at the heart of concerns in Israel. 

Academia can maintain its excel- 
lence only if it operates without undue 
interference by nonexperts. Academic 
studies are dynamic and are cross- 
fertilized through interactions within 
the international community. Con- 
trol or limitation of this dynamic by 
government and politics is bound to 
reduce excellence. In particular, because of Israel’s geo- 
graphic and geopolitical isolation, its scientists must 
continue full integration within the global scientific com- 
munity if they are to maintain the excellence they pursue. 

Earlier this month, the presidents of most Israeli uni- 
versities sent an open letter to Israeli government leaders 
stating their concerns about destructive processes that 
will hamper Israel’s scientific strength, such as reduced 
international and philanthropic funding to support 
Israeli research institutions and collaborations with other 
nations. The letter also stated that many Israeli scientists’ 
perception of danger to academic freedom is stoking their 
departure from Israel for a better future elsewhere. 

Recent moves by the government to stifle academic 
freedom include attempts in February by the Minister of 
Education to influence appointments of the governing 
body and rector of the National Library Public. Public re- 
sistance halted these blatant moves to stamp out the au- 
tonomy of an academic institution. There have also been 
government attempts to influence the nomination of the 


“This assault 
on democratic 
institutions...Is 


an imminent 
threat to Israeli 
academia...” 


Council for Higher Education’s vice chair without proper 
consultation with academic institutions. 

At this point, over 150 proposed bills that will alter the 
democratic nature of the country are in different stages 
of legislation. At least eight will directly limit academic 
freedom, freedom of speech in academic institutions, and 
minority rights, enforced through sanctions to limit fund- 
ing. These proposed laws, if passed, threaten the reputa- 
tion and status of Israeli scholarship on the global stage. 
This would degrade the capacity of Israeli institutions to 
participate in the global community and to recruit lead- 
ing scientists, and may trigger an exodus of the best and 
brightest scientists. Outstanding researchers depend on 
costly state-of-the-art facilities that are funded by com- 
petitive international grants. If Israel 
loses its scientific environment of free- 
dom and excellence, the renewal of its 
funding agreement with the European 
Union, for example, will be in jeop- 
ardy. This includes its participation in 
the Horizon Europe program as an as- 
sociated country, which was renewed 
in 2021 by the previous government. 
If cut off from such lucrative and pres- 
tigious programs, many Israeli scien- 
tists will likely seek a new academic 
home where they can be supported to 
conduct robust research as an integral 
part of the wider scientific community. 

In broader terms, increasing the 
religious orientation of elementary 
school and high school curricula, as the government is 
doing, will increase the possibility that science will no 
longer be at the forefront of primary education. This will 
have cascading effects, from a decrease in students pursu- 
ing scholarly studies to a decline in innovative and entre- 
preneurial opportunities in science and technology. 

Israel now finds itself at the vanguard of countries 
descending rapidly into a “hollow democracy” with 
weakened academia. To oppose this demise, the global 
academic community must unite and work together 
vigorously to resist attempts in Israel to undermine aca- 
demic freedom. Petitions from academics around the 
world, national and global academic and scientific soci- 
eties, and scientific policy organizations worldwide, and 
other peaceful means to raise awareness of the situation 
are needed immediately if scholarship and the pursuit of 
knowledge are to be safeguarded throughout the world. 


—Einat Albin, Shikma Bressler, Asya Rolls, 
Michal Schwartz, Ehud Shapiro 
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CLIMATE POLICY 


Climeworks, which operates this 
carbon capture and storage plant 
in Iceland, will help build a similar, 

larger facility in Louisiana. 


United States scales up carbon-removal plants 


he U.S. Department of Energy last week an- 
nounced it will spend $1.2 billion for two pio- 
neering facilities—one in Texas, the other in 
Louisiana—that will vacuum millions of tons of 
carbon dioxide (CO,) annually from the skies using 
a technology known as direct air capture (DAC). 
The investment marks the first major governmental 
backing in the world for the emerging carbon-capture 
technology. The program aims to create four DAC hubs 
over the next 10 years, each capable of removing and 
storing at least 1 million tons of CO, annually. Today, 


18 DAC plants worldwide collectively remove only about 
11,000 tons of CO, a year. Critics have slammed DAC as a 
boondoggle because it requires substantial inputs of en- 
ergy to capture CO, and pump it underground. Currently, 
DAC typically costs more than $1000 to sequester each 
ton of CO,, far more expensive than planting trees or 
other carbon-removal strategies. But DAC proponents 
counter that its costs may fall. They add that the limita- 
tions of other methods may require countries seeking to 
limit climate warming to no more than a 1.5°C rise by 
midcentury to use DAC. 


Youths win Montana climate case 


LEGAL AFFAIRS | A Montana state judge 
ruled this week in favor of young people 
who argued that the state’s policies sup- 
porting fossil fuels violate their right under 
its constitution to a “clean and healthful 
environment.” The ruling invalidates a 
provision of a Montana law that prevented 
state officials from considering the climate 
impacts of energy projects. In June, the case, 
Held v. Montana, went to trial, the first of its 
kind in the United States. Plaintiffs ranging 
in age from 5 to 22 asserted that Montana’s 
policies created environmental damage 

that harmed them directly, such as wild- 

fire smoke that worsened asthma. Climate 
scientists called as expert witnesses testified 
about Montana’s worsening natural disasters 
and high rate of atmospheric warming. The 
state is expected to appeal. The case was 
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spearheaded by the nonprofit Our Children’s 
Trust, which has related lawsuits pending in 
four other states. Similar victories have been 
won in European courts. 


Studies halted at N.Y. institute 


RESEARCH OVERSIGHT | AUS. watchdog 
office in June suspended 124 clinical trials 
underway at the New York State Psychiatric 
Institute, a facility affiliated with Columbia 
University. The unusual move by the Office 
for Human Research Protections (OHRP), 
first reported last week by The New York 
Times, came 2 weeks after the institute itself 
paused all 417 human-subject studies under- 
way there. The actions followed the suicide 
more than a year ago of a participant in the 
placebo arm of a study testing a Parkinson’s 
disease drug, levodopa, to treat depres- 

sion and reduced mobility; that study was 


suspended in January 2022. Its lead investi- 
gator, psychiatrist Bret Rutherford, resigned 
from the institute on 1 June and is no longer 
on the Columbia faculty. Two journals have 
retracted papers related to Rutherford’s 
levodopa research, citing methodological 
problems. The institute said in a statement 
it is “working with its federal partners to 
create a research safety review plan” and 

is awaiting OHRP’s approval to resume the 
federally funded studies. 


China faces mpox surge 


INFECTIOUS DISEASES | China last week 
confirmed 491 cases of mpox in July, a 
monthly high and nearly five times June’s 
total. Cases of the infectious disease, 
formerly known as monkeypox, have also 
increased in other Asian countries after 
plummeting in the Americas and Europe. 
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The disease is spread through intimate 
personal contact and has particularly 
affected men who have sex with men; more 
than 96% of the infections in China were 
among this group, according to a 9 August 
announcement by the Chinese Center for 
Disease Control and Prevention. The agency 
reported no severe cases or deaths. Officials 
believe China’s strict COVID-19 restrictions 
on international travel, in place until this 
spring, kept the disease at bay during the 

10 months starting in July 2022 when the 
World Health Organization designated mpox 
a Public Health Emergency of International 
Concern. With no vaccine yet available 
domestically, China is relying on raising 
public awareness and changes in behavior to 
limit the outbreak. 


Muon magnetism mulled 


PARTICLE PHYSICS | Physicists at Fermi 
National Accelerator Laboratory last week 
released a new measurement of the magne- 
tism of a particle called the muon, a heavier, 
unstable cousin of the electron. With an 
uncertainty of 0.2 parts per billion, the new 
result confirms with twice the precision the 
one released 2 years ago. However, it has 
not elicited as much excitement as the 2021 
result, which appeared to slightly exceed the 
prediction of physicists’ prevailing standard 
model, possibly signaling undiscovered 
particles lurking in the vacuum around the 
muon. Theorists now realize the standard 
model’s prediction was less certain than 
thought, obscuring whether the muon is 
really extra-magnetic. They hope to make a 
more reliable prediction before experiment- 
ers release their final result in 2025. 


NIH reverses course on union 


LABOR RELATIONS | After questioning the 
legal standing of a unionization effort by 
early-career researchers, the U.S. National 
Institutes of Health (NIH) now says it won’t 
oppose a vote by potential union members. 
In a filing with the Federal Labor Relations 
Authority, NIH officials had contended that 
many potential members, including postdocs 
and graduate students, weren’t “employees” 
with a right to unionize. But they reversed 
that position last week. 


India creates research funder 


SCIENCE POLICY | India’s Parliament 

last week approved a new research fund- 
ing agency aimed at boosting the nation’s 
scientific standing. But some analysts 

are skeptical that the Anusandhan 
National Research Foundation will have a 
major impact. (Anusandhan is Hindi for 
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6% 


Share of U.S. survey respondents who 
support requiring all advanced 
artificial intelligence models to have 
their safety evaluated by 
independent experts. (Artificial 
Intelligence Policy Institute) 


“nnovation.”) The foundation is rooted in a 
2019 proposal to create an agency, modeled 
after the U.S. National Science Foundation, 
that would be funded by the government 
while having an arm’s-length relationship 
with elected officials (Science, 14 July, p. 118). 
But the structure approved by lawmakers 
gives India’s prime minister broad powers to 
appoint the agency’s leaders. Critics say that 
opens the door to political interference. And 
some wonder whether the agency can real- 
ize its funding plan, which calls for industry 
to contribute about 70% of an envisioned 

$6 billion outlay over 5 years. 


China leads in internal citations 


PUBLISHING | More than 60% of recent 
citations of papers from Chinese institutions 
were made by China-based researchers, 
according to an analysis by Japan’s National 
Institute of Science and Technology Policy. 
The unusually high rate raises questions 
about whether intentional efforts to boost 
the reputation of Chinese institutions is 
contributing to China’s large share of the 
most cited papers. The comparable figure 
for the United States was 29% and for other 


countries less than 20%, according to the 
analysis, which averaged data from 2019 
to 2021. China produced the biggest share 
(nearly 30%) of papers among the top 1% 
most highly cited, the report says, as well as 
the most scientific publications indexed in 
Clarivate’s Web of Science Core Collection. 
The United States was second in those 
categories. The analysis did not explain 
whether the citations by authors in China 
are relevant to the studies in which they 
appear or indicate padding. 


Deal aids red wolf conservation 


WILDLIFE | The US. Fish and Wildlife 
Service (FWS) has agreed to continue to 
release captive-bred red wolves (Canis rufus) 
to boost the only wild population of this 
endangered species, reversing an earlier 
decision to stop the releases. The new policy 
resulted from a court settlement, announced 
last week, with conservation groups that had 
sued the agency. Although red wolves once 
lived in most of the southeastern United 
States, hunting and habitat loss caused a 
massive fall off that by 1972 drove the spe- 
cies nearly to extinction—unusual among 
recent declines of large North American 
mammals. Decades of captive breeding and 
releases lifted the wild population to only 
about 120 animals. Car accidents and poach- 
ing have taken a toll, and local landowners 
have objected to the releases. After FWS 
stopped them in 2015, the wild population 
fell to about 15 individuals. Now, the agency 
has made an 8-year pledge to release wolves 
under a restoration plan more thoroughly 
informed by research. Some supporters say 
the population’s recovery will require an 
additional measure—policing poachers. 


A red wolf and her pup are part of a captive population in North Carolina used to help increase the only wild one. 
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Maui i's deadly blazes reveal a fire-proné Hawaii 


Flammable, invasive grasses have changed the island landscape, say its shaken scientists 


By Molly Rains 


cott Fisher was one of the lucky ones. 

Last week, early on 8 August, the res- 

toration ecologist and his wife were 

awoken in their home by cellphone 

emergency alerts. Outside, on the Ha- 

waiian island of Maui, high winds buf- 
feted their house, shaking eucalyptus and 
palm trees and scattering debris across the 
landscape. About 2.5 kilometers away, cut- 
ting a path through the darkness, a wildfire 
was burning through the dry forests of the 
island’s upcountry. 

Fisher and his wife escaped to safety, but 
as Science went to press, more than 100 peo- 
ple on Maui had been confirmed killed by 
multiple wildfires last week—and some offi- 
cials fear the true death toll is much higher. 
Maui’s tight-knit scientific community has 
begun to assess the disruptions to island re- 
search and conservation efforts, but all that 
was overshadowed by the loss of so many 
lives. “We're still wrapping our heads around 
what this really means, because right now, 
most of us are still in shock,’ says Marc 
Lammers of the Hawaiian Islands Hump- 
back Whale National Marine Sanctuary. 

The marine mammal ecologist and other 
scientists on Maui study whales from ships 
that are—or were—based in Lahaina, a his- 
toric town razed by the fires. Its harbor was 
“the hub of whaling science here in Hawaii,” 
Lammers says. Now, it is vacant, strewn 
with charred debris. 
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Fisher, who grew up on Maui and works 
for the Hawaii Land Trust, notes that geo- 
logical evidence suggests major fires were 
uncommon in Maui’s past. But they have 
become an increasingly recurrent—and 
worsening—threat in recent decades, es- 
pecially as nonnative, highly flammable 
grasses invade the island’s large, abandoned 
plantations and previously burned land- 
scapes. “Every fire it just gets worse,” says 
Fisher, who helped his neighbors evacuate 
to safer locations. 

Their exact origin unknown, last week’s 
fires were driven by a complex host of fac- 
tors, Fisher says, including a lengthy drought 
compounded by “exceptional” winds and low 
humidity. Add in Maui’s widespread invasive 
vegetation and it was a “perfect recipe for 
disaster,’ says ecologist Carla D’Antonio of 
the University of California, Santa Barbara, 
who has studied the fire potential of Ha- 
waii’s nonnative grasses for decades (Science, 
5 August 2022, p. 568). 

She and others call last week’s fires un- 
precedented. “It’s never been this bad be- 
fore,’ says oceanographer Andrea Kealoha, 
a lifelong Maui resident and director of an 
ocean and groundwater laboratory on the 
island. Previous fires on Maui had been 
in relatively remote locations, she notes. 
Now, researchers wonder whether the 
blazes foreshadow a fiery future for it and 
other Hawaiian islands. The “big, big ques- 
tion on everybody’s mind,” Fisher says, is: 
“What can we do now?” 


Spread by fierce winds last week, the westernmost of 
Maui’s major wildfires destroyed the town of Lahaina 
and killed scores of people. 


From a scientific perspective, Kealoha is 
concerned that postfire soil erosion will re- 
lease sediment into the island’s waterways, 
and eventually the ocean, creating a murky 
mess that could profoundly disturb local 
aquatic ecosystems. The phenomenon “can 
shift an entire ecosystem away from corals 
to one dominated by algae,” she says. 

Future fires could further reshape Maui’s 
distinctive ecology, which conservationists 
were already struggling to save. The Maui 
Bird Conservation Center, run by the San 
Diego Zoo and home to many local species 
no longer, or rarely, found in the wild, was 
among the research facilities that barely 
escaped last week’s blazes. “Had the center 
burned, a good proportion of the [‘alala, 
an endangered Hawaiian crow] flock could 
have disappeared,” says Christa Seidl, a 
wildlife ecologist with the Maui Forest 
Bird Recovery Project. “That would ... po- 
tentially threaten that species with literal 
extinction—not just extinction in the wild.” 

The fires did not reach the University of 
Hawaii’s (UH’s) Olinda Rare Plant Facility, 
but high winds blew apart growing houses 
and some buildings, notes biologist Arthur 
Medeiros of the Auwahi Forest Restoration 
Project. “A phenomenal collection of propa- 
gated native endangered plants was heavily 
damaged,” he says. 
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Shifting ecosystems across Hawaii are 
making all the islands more vulnerable to 
wildfires, according to Dan Rubinoff, an in- 
vasive species biologist at UH. “It’s getting 
worse, probably due to climate change, in 
terms of us having longer dry periods,” he 
says. “But it’s also exacerbated by differences 
in land use and habitat.” 

European colonization of Hawaii 2 centu- 
ries ago accelerated introductions of invasive 
species. Many of the foreign plants, such as 
fountain grass (Pennisetum setaceum) and 
haole koa (Leucaena leucocephala), are fire- 
tolerant. They grow quickly, serving as fodder 
for blazes when they ignite and reestablishing 
rapidly on burned areas, crowding out native 
species. In particular, various species of Afri- 
can pasture grasses now dominate large ar- 
eas, D’Antonio says. Droughts can drive their 
desiccation, creating large amounts of ignit- 
able fuel. “This sort of calamity has been a 
long time coming,” she says. 

One reason these foreign grasses have 
room to expand, researchers say, is that 
Maui’s native plant life is voraciously con- 
sumed by the thousands of feral goats and 
other nonnative hoofed creatures (includ- 
ing deer, hogs, and sheep) that run wild in 
the island’s forests. Invasive species—both 
plant and animal—also pose a barrier to re- 
establishing native forests that scientists 
believe are essential to Maui’s environ- 
mental health and fire-resilience. Target- 
ing these invasives is just one piece of the 
puzzle, says Medeiros, who also calls for 
incorporating more fire-resistant vegeta- 
tion into the Maui landscape. “There is no 
magic bullet,” he says. 

Indeed, Fisher counterintuitively suggests 
goats and other grazing ungulates could 
be strategically deployed to keep invasive 
grasses in check. “I think getting these ani- 
mals to work in a defined area ... might be 
one” fire prevention strategy, he says. 

It won’t be easy for Maui to recover from 
the fires, which Hawaii’s governor has called 
the worst in the state’s history. Kealoha 
anticipates that it will take many years to 
rebuild, but she retains hope that Maui 
and its research community can rebound 
stronger. Already because of the tragedy, 
she says, “we’re working with more groups, 
building collaborations and relationships 
with people who have handled these kinds 
of disasters before.” 

In the meantime, many of Maui’s scien- 
tists are in mourning. Even this week, as the 
fires continued to smolder, many community 
members remained missing. “Our staff,’ says 
UH conservation biologist Shaya Honarvar, 
“are still in the midst of this tragedy.” 


With reporting by Tanvi Dutta Gupta and 
Elizabeth Pennisi. 
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NIH project probes the human 
body’s multitude of genomes 


Agency launches major effort to explore importance 
of accumulating mutations in somatic cells 


By Mitch Leslie 


very person starts with just one ge- 

nome, the unique amalgam of pater- 

nal and maternal DNA in the fertilized 

egg. And researchers long thought that 

over a lifetime, pretty much all of the 

body’s diverse cells inherit that same 
genome. But large-scale DNA sequencing 
over the past decade or so has toppled that 
view, showing that human DNA starts to ac- 
crue mutations early in embryonic develop- 
ment and continues to change throughout 
life. “The genome you are conceived with 
is very different from the genome you die 
with,” says cardiovascular biologist Kenneth 
Walsh of the University of Virginia. 

As a result, every person is actually a mo- 
saic of genomes, varying across the body and 
often within the same organ or tissue. These 
DNA changes introduce a diversity to the 
body’s somatic, or nonreproductive, cells that 
may be as important to health as the more 
pervasive alterations inherited from parents. 
Now, the National Institutes of Health (NIH) 
has launched a 5-year, $140 million project 
to map this universe of genomic diversity— 
and probe why it matters. 

Known as Somatic Mosaicism Across 
Human Tissues (SMaHT), the program will 
measure the baseline frequency of these 
mutations in an assortment of tissues to 
help researchers better understand how 
the alterations contribute to health and 
disease. SMaHT, which in May doled out 
its first 22 grants, aims to collect samples 
of 15 tissues from 150 healthy people who 
donated their bodies for research. It has 
funded five teams to sequence DNA from 
these samples—they should begin in the 
coming months—and is backing others to 
develop new technologies for analyzing 
genetic variants and probing their effects. 

“The idea is to be able to at least cata- 
log the mutations” so that researchers can 
delve into links with diseases, says genomi- 
cist Harsha Doddapaneni of Baylor Col- 
lege of Medicine, who helps lead one of the 
SMaHT sequencing groups. 

For decades, the conventional wisdom held 
that a person’s somatic cells could pick up 
mutations but that these genome alterations 


were rare and not a major cause of health 
problems. Mutations in the skin, for example, 
occasionally resulted in unusual pigment pat- 
terns such as port-wine stain birthmarks. 

But scientists now know that our ge- 
nomes are riddled with somatic mutations. 
Even in young children, some cells already 
carry thousands of these alterations, and 
one study found that lung cells from a for- 
mer smoker in her 70s boasted more than 
15,000 mutations each. “We used to think 
about the genome. Now, we think about our 
genomes,” says oncologist Dan Landau of 
Weill Cornell Medicine. 


The vast majority of these changes likely 
have no impact on our health. A portion can 
trigger cancers, however, and other muta- 
tions may drive different illnesses or cause 
premature deaths. Clonal hematopoiesis, 
a variety of mosaicism that affects blood- 
forming cells and becomes more common 
with age, almost doubles the likelihood 
of developing cardiovascular disease and 
boosts the risk of dying from any cause by 
40% (Science, 10 November 2017, p. 714). As 
men get older, they become more vulner- 
able to another type of mosaicism in which 
the Y chromosome vanishes from some of 
their cells. Its absence may set them up for 
ailments such as cardiovascular disease and 
macular degeneration. 

The brain can also incur damage as neu- 
rons and other cells accumulate mutations. 
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Shifting ecosystems across Hawaii are updates 


ILLUSTRATION: ELYMAS/SHUTTERSTOCK 


making all the islands more vulnerable to 
wildfires, according to Dan Rubinoff, an in- 
vasive species biologist at UH. “It’s getting 
worse, probably due to climate change, in 
terms of us having longer dry periods,” he 
says. “But it’s also exacerbated by differences 
in land use and habitat.” 

European colonization of Hawaii 2 centu- 
ries ago accelerated introductions of invasive 
species. Many of the foreign plants, such as 
fountain grass (Pennisetum setaceum) and 
haole koa (Leucaena leucocephala), are fire- 
tolerant. They grow quickly, serving as fodder 
for blazes when they ignite and reestablishing 
rapidly on burned areas, crowding out native 
species. In particular, various species of Afri- 
can pasture grasses now dominate large ar- 
eas, D’Antonio says. Droughts can drive their 
desiccation, creating large amounts of ignit- 
able fuel. “This sort of calamity has been a 
long time coming,” she says. 

One reason these foreign grasses have 
room to expand, researchers say, is that 
Maui’s native plant life is voraciously con- 
sumed by the thousands of feral goats and 
other nonnative hoofed creatures (includ- 
ing deer, hogs, and sheep) that run wild in 
the island’s forests. Invasive species—both 
plant and animal—also pose a barrier to re- 
establishing native forests that scientists 
believe are essential to Maui’s environ- 
mental health and fire-resilience. Target- 
ing these invasives is just one piece of the 
puzzle, says Medeiros, who also calls for 
incorporating more fire-resistant vegeta- 
tion into the Maui landscape. “There is no 
magic bullet,” he says. 

Indeed, Fisher counterintuitively suggests 
goats and other grazing ungulates could 
be strategically deployed to keep invasive 
grasses in check. “I think getting these ani- 
mals to work in a defined area ... might be 
one” fire prevention strategy, he says. 

It won’t be easy for Maui to recover from 
the fires, which Hawaii’s governor has called 
the worst in the state’s history. Kealoha 
anticipates that it will take many years to 
rebuild, but she retains hope that Maui 
and its research community can rebound 
stronger. Already because of the tragedy, 
she says, “we’re working with more groups, 
building collaborations and relationships 
with people who have handled these kinds 
of disasters before.” 

In the meantime, many of Maui’s scien- 
tists are in mourning. Even this week, as the 
fires continued to smolder, many community 
members remained missing. “Our staff,’ says 
UH conservation biologist Shaya Honarvar, 
“are still in the midst of this tragedy.” 


With reporting by Tanvi Dutta Gupta and 
Elizabeth Pennisi. 
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NIH project probes the human 
body’s multitude of genomes 


Agency launches major effort to explore importance 
of accumulating mutations in somatic cells 


By Mitch Leslie 


very person starts with just one ge- 

nome, the unique amalgam of pater- 

nal and maternal DNA in the fertilized 

egg. And researchers long thought that 

over a lifetime, pretty much all of the 

body’s diverse cells inherit that same 
genome. But large-scale DNA sequencing 
over the past decade or so has toppled that 
view, showing that human DNA starts to ac- 
crue mutations early in embryonic develop- 
ment and continues to change throughout 
life. “The genome you are conceived with 
is very different from the genome you die 
with,” says cardiovascular biologist Kenneth 
Walsh of the University of Virginia. 

As a result, every person is actually a mo- 
saic of genomes, varying across the body and 
often within the same organ or tissue. These 
DNA changes introduce a diversity to the 
body’s somatic, or nonreproductive, cells that 
may be as important to health as the more 
pervasive alterations inherited from parents. 
Now, the National Institutes of Health (NIH) 
has launched a 5-year, $140 million project 
to map this universe of genomic diversity— 
and probe why it matters. 

Known as Somatic Mosaicism Across 
Human Tissues (SMaHT), the program will 
measure the baseline frequency of these 
mutations in an assortment of tissues to 
help researchers better understand how 
the alterations contribute to health and 
disease. SMaHT, which in May doled out 
its first 22 grants, aims to collect samples 
of 15 tissues from 150 healthy people who 
donated their bodies for research. It has 
funded five teams to sequence DNA from 
these samples—they should begin in the 
coming months—and is backing others to 
develop new technologies for analyzing 
genetic variants and probing their effects. 

“The idea is to be able to at least cata- 
log the mutations” so that researchers can 
delve into links with diseases, says genomi- 
cist Harsha Doddapaneni of Baylor Col- 
lege of Medicine, who helps lead one of the 
SMaHT sequencing groups. 

For decades, the conventional wisdom held 
that a person’s somatic cells could pick up 
mutations but that these genome alterations 


were rare and not a major cause of health 
problems. Mutations in the skin, for example, 
occasionally resulted in unusual pigment pat- 
terns such as port-wine stain birthmarks. 

But scientists now know that our ge- 
nomes are riddled with somatic mutations. 
Even in young children, some cells already 
carry thousands of these alterations, and 
one study found that lung cells from a for- 
mer smoker in her 70s boasted more than 
15,000 mutations each. “We used to think 
about the genome. Now, we think about our 
genomes,” says oncologist Dan Landau of 
Weill Cornell Medicine. 


The vast majority of these changes likely 
have no impact on our health. A portion can 
trigger cancers, however, and other muta- 
tions may drive different illnesses or cause 
premature deaths. Clonal hematopoiesis, 
a variety of mosaicism that affects blood- 
forming cells and becomes more common 
with age, almost doubles the likelihood 
of developing cardiovascular disease and 
boosts the risk of dying from any cause by 
40% (Science, 10 November 2017, p. 714). As 
men get older, they become more vulner- 
able to another type of mosaicism in which 
the Y chromosome vanishes from some of 
their cells. Its absence may set them up for 
ailments such as cardiovascular disease and 
macular degeneration. 

The brain can also incur damage as neu- 
rons and other cells accumulate mutations. 
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“Somatic mutations are disease-causing in 
some proportion of patients with epilepsy,” 
says Alissa D’Gama, a clinical fellow at 
Boston Children’s Hospital. In addition, 
researchers estimate that fetal mutations 
that may modify brain development ac- 
count for about 3% to 5% of the risk of 
developing autism spectrum disorder. 
Studies have also linked mosaic brain mu- 
tations to a variety of other neurological 
disorders, including schizophrenia and AlI- 
zheimer’s disease. “Somatic mosaicism is 
likely playing a role” in these conditions, 
D’Gama says. “What needs to be teased out 
is how big a role.” 

Plenty of other mysteries about somatic 
mutations remain—for example, whether 
certain tissues accumulate more mutations 
than others. “Only a few types of tissues 
have been investigated,’ notes developmen- 
tal scientist Flora Vaccarino of Yale School 
of Medicine, who has SMaHT funding. 

Researchers also want to determine 
whether some somatic mutations benefit us. 
Scientists have found that individual cells 
can gain from certain changes that give them 
and their descendants, known as a clone, 
a competitive advantage 
over other clones. However, 
what’s good for specific cells 
isn’t necessarily good for 
the tissues they inhabit— 
cancer is a prime example— 
and whether any somatic 
changes improve our overall 
health is unclear. But some 
evidence hints they can. 
For example, scientists have 
found that clones carrying 
certain mutations have an 
advantage in people with fatty liver disease, 
a condition in which fat accumulates in the 
organ and can cause it to fail. Those findings 
raise the possibility that the alterations help 
the liver cope with disease. 

To test that idea, liver biologist Hao Zhu 
of the University of Texas Southwestern 
Medical Center and colleagues mimicked 
gene-disabling somatic mutations in mice 
with nonalcoholic steatohepatitis (NASH), a 
type of fatty liver disease that is increasingly 
common in people. The researchers deleted 
each of 63 genes linked to NASH from a sub- 
set of cells in the livers of the animals. Six 
months later, the team found, clones miss- 
ing some of the genes had the upper hand. 
Because these elite clones grew faster, Zhu 
and colleagues next asked what would hap- 
pen if one of them prevailed, expanding so 
much that it replaced its rivals. To find out, 
they deleted some of the genes throughout 
the rodents’ livers. For three of the genes, 
the mice gained protection against fatty 
liver disease, accumulating less fat and in- 
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“The genome you 
are conceived with is 
very different 
from the genome you 


die with.” 


Kenneth Walsh, 
University of Virginia 


curring less tissue damage. Liver clones 
probably grow too slowly in people to re- 
verse fatty liver disease, Zhu says, but the 
discovery could point to new treatments. 
By providing the first bodywide refer- 
ence for somatic mutations, SMaHT will 
help scientists investigate their roles. How- 
ever, finding these mutations is challeng- 
ing. Researchers and clinical labs are adept 
at uncovering mutations in tumors, but 
those alterations are typically found in a 
large fraction of the abnormal cells. In con- 
trast, some somatic mutations occur in less 
than 1% of cells in a tissue. Today’s DNA- 
decoding technology can miss such rare mu- 
tations because it has a relatively high error 
rate. Moreover, researchers often sequence 
DNA from multiple cells simultaneously, 
which can swamp rare changes. “You are 
looking for a signal that is hidden among 
noise,” says geneticist Martin Breuss of the 
University of Colorado School of Medicine. 
For SMaHT, five groups of research- 
ers plan to use several techniques to fer- 
ret out and verify these hidden signals. 
Doddapaneniy, his Baylor colleague Rui Chen, 
and their team, for example, will deploy not 
just conventional whole 
genome DNA sequencing, 
but also two types of RNA 
sequencing, which can help 
confirm some variants and 
identify the types of cells 
carrying them. To weed out 
errors, they will also use du- 
plex sequencing, a less com- 
mon technique that decodes 
both strands of DNA’s double 
helix to reduce mistakes and 
pinpoint rare mutations. 
The remaining SMaHT-funded teams will 
tackle a variety of projects. Geneticist Kath- 
leen Burns of the Dana-Farber Cancer Insti- 
tute and her colleagues want to better define 
the role of transposons, lengths of DNA that 
modify the genome by moving from place to 
place. The researchers will develop ways to 
identify restless elements that are primed to 
relocate and pin down where in the genome 
they insert. Work by other groups will in- 
clude probing how somatic mutations affect 
gene activity and developing new approaches 
for sequencing the genomes of single cells. 
SMaHT won’t answer every question 
about somatic mutations. Walsh notes that 
although the program will obtain tissues 
from people of different ages, it won’t in- 
clude samples taken over time from the same 
person, making it harder to understand the 
mutations’ role in aging, he says. But it is a 
key next step in what Landau calls “a huge 
revolution in human genetics.” He is eager to 
see the results. “We are just at the beginning 
of this incredible adventure.” 


ASTRONOMY 


After string of 
failures, Japan 
aims to launch 
x-ray telescope 


Groundbreaking XRISM will 
capture spectra, revealing 
the motion and composition 
of million-degree gases 


By Daniel Clery 


he first one failed to reach orbit. The 
second died soon after getting to 
space, when its helium coolant was 
accidentally dumped. The third one 
lasted for 37 days before its spacecraft 
broke apart in a fatal spin. 

The Japan Aerospace Exploration Agency 
(JAXA) is hoping the fourth time is the 
charm for a revolutionary x-ray instrument 
that will give astronomers unprecedented 
views of the hot gases around supernovae 
and black holes and within galaxy clusters. 
On 26 August, the agency plans to launch the 
X-Ray Imaging and Spectroscopy Mission 
(XRISM), a telescope fitted with a NASA- 
developed device to do something that has 
long been a challenge for x-ray telescopes: 
Tease apart the x-rays’ wavelengths, the way 
a prism splits visible light. Detailed x-ray 
spectroscopy will allow researchers to not 
only see the hot gases, but also discover what 
they're made of and how they’re moving. 
“Tt’s a completely new kind of detector,’ says 
astrophysicist Poshak Gandhi of the Univer- 
sity of Southampton. 

Because Earth’s atmosphere blocks x-rays, 
astronomers must go to space to see them. 
Even there, making x-ray images is a chal- 
lenge. Because x-rays go straight through 
conventional mirrors, the photons must be 
gathered and focused by glancing reflections 
off nested cylindrical mirrors. X-ray tele- 
scopes using that approach can image the hot 
gases, which make up more than half the vis- 
ible matter in the universe. But astronomers 
want more, Gandhi says. “We really need to 
be able to distinguish the different colors of 
x-ray light.” 

Regular spectrometers struggle with x- 
rays, especially high energy photons from 
extended sources. But in the 1990s, engi- 
neers at NASA’s Goddard Space Flight Cen- 
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Boston Children’s Hospital. In addition, 
researchers estimate that fetal mutations 
that may modify brain development ac- 
count for about 3% to 5% of the risk of 
developing autism spectrum disorder. 
Studies have also linked mosaic brain mu- 
tations to a variety of other neurological 
disorders, including schizophrenia and AlI- 
zheimer’s disease. “Somatic mosaicism is 
likely playing a role” in these conditions, 
D’Gama says. “What needs to be teased out 
is how big a role.” 

Plenty of other mysteries about somatic 
mutations remain—for example, whether 
certain tissues accumulate more mutations 
than others. “Only a few types of tissues 
have been investigated,’ notes developmen- 
tal scientist Flora Vaccarino of Yale School 
of Medicine, who has SMaHT funding. 

Researchers also want to determine 
whether some somatic mutations benefit us. 
Scientists have found that individual cells 
can gain from certain changes that give them 
and their descendants, known as a clone, 
a competitive advantage 
over other clones. However, 
what’s good for specific cells 
isn’t necessarily good for 
the tissues they inhabit— 
cancer is a prime example— 
and whether any somatic 
changes improve our overall 
health is unclear. But some 
evidence hints they can. 
For example, scientists have 
found that clones carrying 
certain mutations have an 
advantage in people with fatty liver disease, 
a condition in which fat accumulates in the 
organ and can cause it to fail. Those findings 
raise the possibility that the alterations help 
the liver cope with disease. 

To test that idea, liver biologist Hao Zhu 
of the University of Texas Southwestern 
Medical Center and colleagues mimicked 
gene-disabling somatic mutations in mice 
with nonalcoholic steatohepatitis (NASH), a 
type of fatty liver disease that is increasingly 
common in people. The researchers deleted 
each of 63 genes linked to NASH from a sub- 
set of cells in the livers of the animals. Six 
months later, the team found, clones miss- 
ing some of the genes had the upper hand. 
Because these elite clones grew faster, Zhu 
and colleagues next asked what would hap- 
pen if one of them prevailed, expanding so 
much that it replaced its rivals. To find out, 
they deleted some of the genes throughout 
the rodents’ livers. For three of the genes, 
the mice gained protection against fatty 
liver disease, accumulating less fat and in- 
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curring less tissue damage. Liver clones 
probably grow too slowly in people to re- 
verse fatty liver disease, Zhu says, but the 
discovery could point to new treatments. 
By providing the first bodywide refer- 
ence for somatic mutations, SMaHT will 
help scientists investigate their roles. How- 
ever, finding these mutations is challeng- 
ing. Researchers and clinical labs are adept 
at uncovering mutations in tumors, but 
those alterations are typically found in a 
large fraction of the abnormal cells. In con- 
trast, some somatic mutations occur in less 
than 1% of cells in a tissue. Today’s DNA- 
decoding technology can miss such rare mu- 
tations because it has a relatively high error 
rate. Moreover, researchers often sequence 
DNA from multiple cells simultaneously, 
which can swamp rare changes. “You are 
looking for a signal that is hidden among 
noise,” says geneticist Martin Breuss of the 
University of Colorado School of Medicine. 
For SMaHT, five groups of research- 
ers plan to use several techniques to fer- 
ret out and verify these hidden signals. 
Doddapaneniy, his Baylor colleague Rui Chen, 
and their team, for example, will deploy not 
just conventional whole 
genome DNA sequencing, 
but also two types of RNA 
sequencing, which can help 
confirm some variants and 
identify the types of cells 
carrying them. To weed out 
errors, they will also use du- 
plex sequencing, a less com- 
mon technique that decodes 
both strands of DNA’s double 
helix to reduce mistakes and 
pinpoint rare mutations. 
The remaining SMaHT-funded teams will 
tackle a variety of projects. Geneticist Kath- 
leen Burns of the Dana-Farber Cancer Insti- 
tute and her colleagues want to better define 
the role of transposons, lengths of DNA that 
modify the genome by moving from place to 
place. The researchers will develop ways to 
identify restless elements that are primed to 
relocate and pin down where in the genome 
they insert. Work by other groups will in- 
clude probing how somatic mutations affect 
gene activity and developing new approaches 
for sequencing the genomes of single cells. 
SMaHT won’t answer every question 
about somatic mutations. Walsh notes that 
although the program will obtain tissues 
from people of different ages, it won’t in- 
clude samples taken over time from the same 
person, making it harder to understand the 
mutations’ role in aging, he says. But it is a 
key next step in what Landau calls “a huge 
revolution in human genetics.” He is eager to 
see the results. “We are just at the beginning 
of this incredible adventure.” 
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he first one failed to reach orbit. The 
second died soon after getting to 
space, when its helium coolant was 
accidentally dumped. The third one 
lasted for 37 days before its spacecraft 
broke apart in a fatal spin. 

The Japan Aerospace Exploration Agency 
(JAXA) is hoping the fourth time is the 
charm for a revolutionary x-ray instrument 
that will give astronomers unprecedented 
views of the hot gases around supernovae 
and black holes and within galaxy clusters. 
On 26 August, the agency plans to launch the 
X-Ray Imaging and Spectroscopy Mission 
(XRISM), a telescope fitted with a NASA- 
developed device to do something that has 
long been a challenge for x-ray telescopes: 
Tease apart the x-rays’ wavelengths, the way 
a prism splits visible light. Detailed x-ray 
spectroscopy will allow researchers to not 
only see the hot gases, but also discover what 
they're made of and how they’re moving. 
“Tt’s a completely new kind of detector,’ says 
astrophysicist Poshak Gandhi of the Univer- 
sity of Southampton. 

Because Earth’s atmosphere blocks x-rays, 
astronomers must go to space to see them. 
Even there, making x-ray images is a chal- 
lenge. Because x-rays go straight through 
conventional mirrors, the photons must be 
gathered and focused by glancing reflections 
off nested cylindrical mirrors. X-ray tele- 
scopes using that approach can image the hot 
gases, which make up more than half the vis- 
ible matter in the universe. But astronomers 
want more, Gandhi says. “We really need to 
be able to distinguish the different colors of 
x-ray light.” 

Regular spectrometers struggle with x- 
rays, especially high energy photons from 
extended sources. But in the 1990s, engi- 
neers at NASA’s Goddard Space Flight Cen- 
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ter developed a chip-based sensor called a 
microcalorimeter, which can measure the 
energy—closely related to wavelength—of 
individual x-ray photons. When an x-ray 
strikes one of the mercury telluride pixels in 
the calorimeter, it knocks loose an electron 
and transfers all its energy to it. The electron 
bounces around the pixel, raising its tem- 
perature by a tiny fraction of a degree and 
warming an adjacent temperature sensor. To 
register these tiny quantities of heat, which 
indicate the energy of the original photon, 
the whole device must be cooled to 1/20 of a 
degree above absolute zero. 

Astronomers got a taste of a micro- 
calorimeter’s capability when one flew aboard 
the ill-fated Hitomi telescope, JAXA’s third try. 
Before the 2016 mission broke up in the fatal 
spin, it made groundbreaking observations 
of the Perseus galaxy cluster and a handful 
of other objects. “We glimpsed the prom- 
ised land, but could not go in,’ says Brian 
Williams, NASA‘s project scientist for XRISM. 
In those few weeks, Hitomi did “transforma- 
tional science with one single pointing,” says 
Elisa Costantini of the Netherlands Institute 
for Space Research, principal investigator for 
the Swiss-Dutch contributions to XRISM. “It 
proved how necessary it was.” 

The Perseus cluster is one of the most 
massive objects in the universe, a conglom- 
eration of thousands of galaxies swimming 
in a sea of gas heated to 50 million K. With 
Hitomi’s microcalorimeter, researchers were 
able to see unprecedented details in the 
gas’ x-ray glow. They detected spikes in its 
spectrum that revealed specific elements, 
such as iron, indicating what types of super- 
novae had spewed their heavy elements into 
space. To their surprise, the chemical recipe 
was very similar to that of the Sun. It looked 
“strangely familiar,’ Costantini says. 
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Researchers also saw some of the spikes 
were smeared out by the motions of the 
gas. But not by much: The cluster’s gas was 
surprisingly quiescent, not the maelstrom 
theorists had predicted. One of XRISM’s 
first tasks will be to look at other clusters 
to see whether Perseus is an oddity or the 
norm. “That’s the homework left by Hitomi,” 
says Makoto Tashiro of Saitama University, 
XRISM’s principal investigator. 

In addition to galaxy clusters, XRISM will 
study the hot gases swirling around super- 
novae remnants and black holes, both the 
supermassive ones in galactic centers and 
stellar mass ones that suck material from 
companion stars. “It’s dead easy to find 
black holes in the universe with an x-ray 
telescope,” Gandhi says. 

What isn’t so easy is knowing how swirl- 
ing matter moves and whether some of it is 
blasted outward into the surrounding gal- 
axy, affecting star formation and the galaxy’s 
evolution. Some of the outflows are known 


Asector of one of XRISM’s mirrors, which focus x-rays 
by glancing reflection. 


Japan's X-Ray Imaging and Spectroscopy Mission 
teleScope will open a new eye on the x-ray universe, 
helped by a long-awaited NASA sensor. 


to travel at hundreds of kilometers per sec- 
ond, but there are hints of winds blowing 
hundreds of times faster, which would have a 
profound effect on the host galaxy. “There are 
many things we don’t know,’ Costantini says. 

JAXA has worked hard to ensure the 
$190 million XRISM doesn’t succumb to the 
fates of its predecessors. It has a revamped 
attitude control system, redesigned coolant 
pipework, and a backup mechanical cooler 
that could allow it to operate after its helium 
runs out in 3years. Although XRISM has fewer 
other instruments than Hitomi, “spectro- 
scopy is the strongest point, [and] that can 
extend the science,” Tashiro says. 

Hopes for XRISM are brightening an 
otherwise difficult time for x-ray astron- 
omy. The field relies heavily on two flagship 
missions—NASA’s Chandra X-ray Obser- 
vatory and XMM-Newton from the Euro- 
pean Space Agency (ESA)—that are both 
more than 20 years old, long past their 
design lifetimes. “Chandra’s health is very 
good overall,” says Pat Slane, director of 
the Chandra X-ray Center. But, he says, 
deposits on its filters are reducing sensi- 
tivity and its shiny thermal shielding is 
degrading, leading to overheating. As for 
XMM-Newton, its detectors are aging, says 
Norbert Schartel, ESA’s principal investi- 
gator for the mission, but it “can still do 
science.” The ESA team hopes to eke out 
enough coolant to last until 2030. 

If either telescope dies early, “it will be a 
loss to the field,’ says Jiachen Jiang of the 
University of Cambridge. Replacements won’t 
come until the mid-2030s at the earliest. 

That raises the stakes for a successful 
launch even higher. XRISM isn’t a general- 
purpose machine like Chandra and XMM- 
Newton. But, Williams says, it “will be the 
predominant x-ray mission of the 2020s.” 
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Myth of Alaskan city’s immunity 
to tsunamis dispelled 


Lucky low tide helped Anchorage dodge catastrophic 1964 
tsunami, first analysis of location’s risk finds 


By Christian Elliott 


nacold March evening in 1964, a co- 

lossal earthquake struck off the coast 

of Alaska. At magnitude 9.2, it was the 

largest earthquake ever recorded in 

North America, and it triggered mas- 

sive tsunamis that killed more than 
120 people and leveled communities. But no 
wave reached Anchorage, the state’s biggest 
city. Many concluded that nearby geography 
makes the city immune to tsunamis. 

A new study published this week by the 
Alaska Division of Geological and Geo- 
physical Surveys (DGGS), however, finds 
Anchorage simply got lucky in 1964—and 
might not the next time an earthquake 
strikes the seismically active region. In a 
worst-case scenario—another giant rup- 
ture in a worse place at a worse time—a 
10-meter wave could flood low-lying areas 
of the city and its critical port for more 
than 24 hours, according to the study. 

“T think folks in Alaska should take this 
seriously,’ said Diego Melgar, a tsunami 
scientist at the University of Oregon not 
involved in the study. He says other U.S. 
coastal communities would benefit from 
such an analysis. “This is exactly what we 
need to be doing at a national level.” 

An entrenched public perception holds 
that Anchorage is protected from tsuna- 
mis by the long, shallow Cook Inlet. False 
alarms have reinforced the sense of invul- 
nerability: In recent years, residents have 
dutifully heeded tsunami warnings and 
evacuated, but no wave has struck. After a 
2018 evacuation, an Anchorage Daily News 
story quoted an oceanographer who said, 
“The science doesn’t lend itself to an inun- 
dation in Anchorage.” 

The truth was no one had ever really ex- 


resulting wave toward Anchorage through 
the long Cook Inlet, which has uncom- 
monly large tides. They discovered that 
when the 3-meter-tall wave reached An- 
chorage, hours after the earthquake, the 
city happened to be at low tide, 4 meters 
below the mean. The low tide simply muf- 
fled the tsunami. With a different earth- 
quake timing, the inlet’s tides could have 
just as easily amplified it. 

Concerned, the scientists looked to the 
future, modeling hypothetical earthquakes 
at different locations and times of day, 
based on the earthquake zone’s behavior in 
several thousand years of paleoseismic re- 
cords. In the worst scenario—a magnitude 
9.2 rupture striking farther south than the 
1964 earthquake and triggering a tsunami 
that arrives at high tide—a 10-meter-tall 


Waves of destruction 
Low tides in the Cook Inlet muffled 
the tsunami from the 1964 Alaskan 
earthquake. But if another giant 
rupture occurred at high tide, 
Anchorage could be swamped with 
10 meters of water, according to a 
new hazard assessment. 


Alaska 1964 magnitude 9.2 


earthquake rupture zone 


Anchorage 
e 


Kenai Peninsula 


Cook 
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to take it seriously, even though it’s a tuw-—— 


probability event,” Salisbury says. 

Salisbury and Suleimani found most res- 
idential areas would be out of harm’s way, 
but the Port of Alaska, which processes 
most of the state’s food, fuel, and consumer 
goods, would be inundated. “It’s kind of 
shocking how much we depend on the Port 
of Alaska,” Suleimani says. “We don’t have 
a backup.” The wave would also flood the 
critical single highway connecting Anchor- 
age to Canada and the rest of the continen- 
tal United States. 

Jim Jager, Port of Alaska deputy direc- 
tor, remembers hosting tsunami scientists 
at his docks years ago and being reassured 
the port was safe. He admits he perpetu- 
ated the idea, asking the state for funding 
to modernize the port’s infrastructure be- 
cause unlike others elsewhere, his couldn’t 
be wiped out by a tsunami. Briefed on the 
DGGS report, Jager now plans to fortify the 
docks and install backup batteries above 
the inundation zone so that supplies could 
be unloaded even amid a blackout. “If the 
docks get wiped out, how are we going to 
get food?” he asks. “If you look at any [emer- 
gency] plan for responding to and recover- 
ing from an earthquake, as soon as the Port 
of Alaska is inoperable, the plan collapses.” 

Hermann Fritz, a tsunami expert at the 
Georgia Institute of Technology, thinks the 

study’s models are solid. But he says the 

study doesn’t account for another kind 

of tsunami: secondary ones that arise 

when earthquakes shake loose land- 

slides that crash into the sea and 

Alaska’s many narrow fjords. For 

example, he says, a 1958 earthquake 

in southeastern Alaska triggered the 

Lituya Bay landslide, which created a 

524-meter-tall wave—the largest ever re- 

corded on Earth. With tsunamis, “there are 
always surprises,” Fritz says. 

With the report out, the next step is 
educating the public about the newly rec- 
ognized hazard. At the front lines will be 
Anchorage Office of Emergency Manage- 
ment Programs Manager Audrey Gray. 
She’s glad to have a study backing up her 
gut feeling that the city is not immune to 
tsunamis. Now, her office is updating emer- 
gency operation plans and has launched a 


amined the city’s tsunami risk in detail. So trench joint information system to unify messag- 

Barrett Salisbury, DGGS earthquake and ing across government agencies. A series of 

tsunami hazard program manager, and public meetings will begin in September. | 
Elena Suleimani, the Alaska Earthquake “The saying is all roads lead to Anchorage 2 
Center’s tsunami modeler, recently took a GulfofAlaska up here,” she says. “And we’ve got a road 8 
closer look. They began by simulating the ahead of us, I’ll definitely tell you that.” é 
1964 earthquake, which occurred when the i 
Pacific tectonic plate, which plunges un- Christian Elliott is a freelance science journalist and = 
derneath Alaska, ruptured by as much as 0 200 an audio producer at NASA's Goddard Space z 


20 meters. Their tsunami model traced the 
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SCIENCE POLICY 


Israeli scientists speak out 
against ‘destructive’ policies 


Many fear erosion of academic freedom and loss of talent 


By Michele Chabin, in Jerusalem 


ntil recently, Elena Itskovich, an Israeli 
stem cell biologist who completed a 
postdoc at Stanford University 2 years 
ago, was planning a return to her home 
nation. But Itskovich says she’s now “on 
the fence.” She is uneasy about the poli- 
cies of the Israeli government elected nearly 
8 months ago and largely led by conservative 
nationalists and ultra-Orthodox parties. 

She is not alone in her concerns. Israeli 
researchers have become increasingly vo- 
cal in opposing policies they say threaten 
academic freedom and Israel’s standing as 
a leader in science and technology (see re- 
lated Editorial, p. 715). The government’s 
plans, which include downplaying science 
education and eliminating support for some 
Arab students, are “destructive” and could 
do “irreversible” harm to Israel’s science 
and high-tech sectors, the heads of Israel’s 
public research universities and members 
of a top government science advisory panel 
warned in a letter earlier this month. 

Many Israeli researchers “have feelings of 
apprehension and danger regarding their fu- 
ture,” declared the letter to Prime Minister 
Benjamin Netanyahu and his science and 
education ministers. The government’s poli- 
cies have already prompted some donors and 
investors to pause funding for R&D projects 
in Israel, and some foreign collaborators 
have issued “explicit threats” to cancel joint 
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projects with Israeli institutions, members of 
the Council of University Heads and the Na- 
tional Council for Civilian Research and De- 
velopment noted. There is also “a significant 
decrease in the willingness of leading Israeli 
scientists who are abroad to accept academic 
positions in Israel,’ they wrote. They also 
flagged signs that top researchers in Israel 
are eyeing moves abroad. “Many [Israeli sci- 
entists] are losing confidence and prefer to 
abandon ship,” the authors wrote. 

Historically, academic scientists in Israel 
have been reluctant to air their political 
views in a highly polarized society, says Uri 
Sivan, president of the Technion Israel Insti- 
tute of Technology. “It is indeed unusual for 
us to step in and speak up.” 

But the country has been in turmoil since 
January, when hundreds of thousands of Is- 
raelis began taking to the streets to protest 
what they call “undemocratic” efforts by 
Netanyahu’s government to limit the power 
of Israel’s Supreme Court. Now, many scien- 
tists are attending—even leading—street pro- 
tests around the country. “The motivation is 
the fact that the future of Israel’s academic 
institutions is being jeopardized,” Sivan says. 

In May, for example, the government de- 
cided to increase funding to ultra-Orthodox 
boys’ schools that offer little or no instruction 
in science (Science, 7 July, p. 17). This month, 
it announced a plan to reallocate tens of mil- 
lions of dollars long used to enable Arab high 
school students from East Jerusalem to gain 


in Tel Aviv, Israel, bears a sign reading “Female lab 
workers are struggling for the future of science.” 


admission to the Hebrew University of Jeru- 
salem. The money would instead be used for 
job training, said Finance Minister Bezalel 
Smotrich, who has claimed that “Islamic rad- 
ical cells” have organized on Israeli campuses 
under the guise of academic freedom. 

Hebrew University said Smotrich’s de- 
cision to cancel funding for its program, 
which has served thousands of Arab stu- 
dents, would damage Israeli society and its 
economy for decades to come. 

Concern that the government could erode 
academic freedoms “isn’t theoretical,’ says 
Rivka Carmi, ageneticistand former president 
of Ben-Gurion University of the Negev. “There 
has already been political interference.” 

Carmi and other researchers fear ultra- 
conservative members of the governing 
coalition will try to replace the scientists, 
physicians, and educators who have tradi- 
tionally filled some key posts with less qual- 
ified political appointees. They note that 
right-wing groups have already compiled 
blacklists of academics they consider politi- 
cally unacceptable. 

Tech sector executives are also worried 
about the toll the conflict is taking on what 
some Israeli analysts proudly call “the startup 
nation.” Investment in new Israeli tech firms 
has dropped 29% since the current govern- 
ment took office in late December 2022, com- 
pared with the previous 6 months, according 
to a report issued this month by Start-Up Na- 
tion Central, a nonprofit that promotes inno- 
vation. Initial public offerings of stock in new 
companies, as well as company mergers and 
acquisitions, have dropped to the lowest rates 
in years, the organization said. 

The drop-off will likely have repercussions 
for academic researchers, Sivan says. “We 
are very tightly connected to the high-tech 
industry. We are an ecosystem. You cannot 
have cutting-edge engineering schools when 
the high-tech sector is in trouble.” 

Still, Sivan is optimistic universities will 
ride out the current storm. “Israeli aca- 
demia is very strong,” he says. “The roots are 
deep.” Asya Rolls, a psychoneuroimmuno- 
logist at Technion, says the willingness of 
her fellow academics to protest the govern- 
ment’s actions leaves her feeling “hopeful, 
almost optimistic.” 

In the meantime, Itskovich and her hus- 
band are watching and waiting. “We still 
want to return and hope things will turn 
around in Israel,’ she says. “But we have 
three children and don’t want them raised in 
a country that’s so broken apart.” 


Michele Chabin is a journalist in Israel. 


18 AUGUST 2023 » VOL 381 ISSUE 6659 723 


724: 


DEATH BY FIRE 


an 
ce 8! 
=a 


4 


) 
_ 


! 


Wildfires, intensified by climate change and perhaps human activity, may 
have doomed Southern California’s big mammals 13,000 years ago 


By Michael Price, at the La Brea Tar Pits and Museum in Los Angeles 


rouching beside a shallow pond in 
a haze of smoke, a saber-toothed 
cat fixes hungry eyes on a horse 
sipping a few meters away. Prey 
has been hard to come by: Wild- 
fires have burnt through the brush, 
and bison and other grazers have 
all but disappeared. As the big cat 
pounces, its quarry bolts forward 
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into a tarry pool. Predator and prey alike 
sink hopelessly into the muck known today 
as the La Brea Tar Pits. 

Some 13,000 years later, that saber- 
toothed cat’s jawbone sits in a museum 
drawer alongside those of a western horse, 
ancient bison, dire wolf, ground sloth, and 
yesterday’s camel. Collectively, the denizens 
of the “last drawer,” as researchers call it, 


represent the last of their kind in the sticky, 
fossil-packed asphalt pits surrounded by 
Los Angeles high rises. 

Paleontologists have long tried to under- 
stand why once-numerous populations of 
these and other megafauna vanished across 
North America toward the end of the last 
ice age. A study published on p. 746 of this 
issue of Science points to a new catalyst that 
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ties together the two leading hypotheses: 
human activity and climate change. Each 
played a role, but fire was the key mediator, 
the authors argue. In their scenario, when 
the climate suddenly became warmer and 
drier toward the end of the last ice age, 
human-caused blazes grew out of control, 
permanently altering the landscape—and 
spelling the end for the animals. 

The new study, which precisely dated 
more than 170 fossil bones and found a 
30-fold increase in wildfire near the pits, is 
one of the first to focus on fire as a force 
behind the megafauna’s demise. “I really 
like that this is getting away from the di- 
chotomous arguments we've had forever 
about megafauna extinction in the Ameri- 
cas, that it’s an either-or choice between 
climate- or human-caused,” says Angela 
Perri, a zooarchaeologist at Texas A&M Uni- 
versity who wasn’t involved in the study. 
“The most likely scenario is a combination.” 

Some others agree. “Quite believable,” 
says Jenny McGuire, a paleoecologist at the 
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Emily Lindsey (left) and Regan Dunn display a 
fraction of the unlucky saber-toothed cats, 

dire wolves, sloths, camels, and other mammals 
entombed in La Brea’s tar pits. 


Georgia Institute of Technology who wasn’t 
involved with the work. “This is one of those 
cases where we do see the smoking gun.” 

However, there’s no direct evidence for 
the human part of this equation, cautions 
independent consulting archaeologist Joe 
Watkins, past president of the Society for 
American Archaeology. “There is no real 
way to specifically link [early] humans 
to the increase in fires. ... There are alter- 
nate possibilities.” 

But Watkins and others agree that the 
team’s scenario serves as a warning about 
trends playing out across much of the world 
today: Landscapes are filling with people, cli- 
mate is changing, and wildfires are scorching 
ecosystems. “We're going into a period that’s 
warmer and drier,’ says Brian Codding, an 
anthropologist at the University of Utah who 
wasn’t involved in the study. “Some analogs 
[in the new study] help us understand that 
what we're going into is very likely another 
state change in the ecosystem.” 


FOR MOST OF THE PAST 2.5 million years, 
Southern California was a far cry from the 
sun-kissed chaparral made famous by Hol- 
lywood. Juniper and oak trees grew thick 
among its hills and valleys, interspersed 
with open range. Saber-toothed cats, Ameri- 
can lions, and massive dire wolves preyed 
on mammoths, mastodons, horses, and 
other herbivores—a menagerie recorded 
in fossils extracted from the La Brea tar. 
At some point between 21,000 and 16,000 
years ago, humans voyaged into the region, 
probably traveling along the coast. 

For perhaps a few thousand more years, 
the pollen and fossil records suggest things 
carried on more or less as usual, with mam- 
mals remaining abundant. Then, some- 
time between 10,000 and 13,000 years ago, 
something went terribly wrong, says Emily 
Lindsey, a paleoecologist and excavation 
leader at the La Brea Tar Pits and Museum. 
Verdant woods gave way to scrubby brush- 
land and desert; most of the large mam- 
mals disappeared. Similar scenes played 
out across North America as the world tran- 
sitioned out of the periodic ice ages of the 
Pleistocene and into the warmer Holocene. 
Continentwide, about 80% of megafauna 
went extinct around this time. For decades, 
scientists have tried to find out why. 

Some researchers pin the extinction on 
the global climate shifts. Starting about 
14,000 years ago the Bolling-Allerod warm- 
ing period, marking the beginning of the 
end of the last glaciation, sent tempera- 


tures spiking and ponds drying for about 
1800 years. Then the ice age had one last 
gasp, a 1000-year cold spell known as the 
Younger Dryas, before the climate warmed 
again about 10,000 years ago and settled 
into conditions similar to today’s. By dis- 
rupting ecosystems, the whipsawing cli- 
mate drove large mammals to extinction, 
some researchers argue. 

Others point to the suspicious timing of 
humans entering North America. Equipped 
with stone weaponry, the immigrants de- 
pleted the large herbivores, dooming their 
predators as well, the overkill argument goes. 

But many have questioned humans’ im- 
pact. For starters, although archaeologists 
once thought humans first entered North 
America about 13,000 years ago, around 
the time of the extinctions, newer evidence 
strongly suggests people arrived by at least 
16,000 years ago, and possibly thousands 
of years earlier. So humans had been living 
and hunting on the continent for thousands 
of years before the mass extinction began. 

F. Robin O’Keefe, a biologist at Marshall 
University and first author on the new 
study, says in the past he had been con- 
vinced by the overkill hypothesis. “This 
kind of 20th century masculine ‘we're going 
to hunt them to extinction’ kind of thing,” 
he recalls. Then he saw all the data showing 
people had coexisted with megafauna for 
thousands of years. “I had to unlearn this 
idea that it was going to be humans’ fault,” 
he says. “It’s more nuanced than that.” 

Settling this thorny debate requires 
knowing exactly when numerous species 
disappeared and how the environment 
changed as they did so. But the available 
data were spotty and ambiguous. 

In the early 2000s, a group of research- 
ers connected to La Brea realized their site 
could help unlock this mystery. For tens of 
thousands of years, animals have strayed 
into the shallow pools covering the site’s 
natural asphalt seeps. If unlucky creatures 
ventured a few steps in, however, they 
sank into the sticky mire beneath the sur- 
face. Hungry predators—sometimes whole 
packs—apparently chased after the panick- 
ing animals, turning the pits into carnivore 
traps. Over time, the muck entombed mil- 
lions of animals. 

Today, the museum and urban park is 
home to the world’s largest collection of 
Pleistocene fossils, containing more than 
3.5 million specimens from 60 species of 
mammals alone. That makes it the perfect 
place to pin down the dates for many spe- 
cies’ disappearances, O’Keefe says. “The at- 
traction is numbers,” he says. 

In 2018, he and colleagues started on an 
ambitious plan to radiocarbon date bones 
from a tar pit first excavated in the 1920s. 
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The technique relies on the fact that living 
things absorb tiny amounts of radioactive 
carbon-14 into their tissue. After death, the 
carbon-14 slowly decays into other isotopes 
at a predictable rate, allowing scientists to 
estimate a sample’s age to within a few de- 
cades. But the asphalt of a tar pit is rich 
in very old carbon, which is largely devoid 
of carbon-14. If carbon from the asphalt 
makes its way into a sample being dated, 
the results are useless. 

Using a combination of sonic waves, 
chemical solvents, and molecular filters, 
Lindsey and colleagues at the University 
of California (UC), Irvine, developed a new 
method to painstakingly purge the asphalt 
from their samples and isolate the collagen 
within. Radiocarbon dates from La Brea are 
“always tricky,’ McGuire notes. “But this 
paper definitely has the world’s experts in 
doing that.” 

The team dated 169 specimens from the 
pit’s eight most common species: saber- 
toothed cats (Smilodon fatalis), dire wolves 
(Aenocyon dirus), coyotes (Canis latrans), 
American lions (Panthera atrox), ancient 
bison (Bison antiquus), western horses 
(Equus occidentalis), Harlan’s ground sloths 
(Paramylodon harlani), and yesterday’s 
camels (Camelops hesternus). “They’ve 
probably doubled the number of radio- 
carbon dates for Late Pleistocene mega- 
fauna in North America,” Perri says. “That 
alone is a significant contribution.” 

The results reveal that at La Brea, 
sloths and camels disappeared first, about 
13,600 years ago. Then 13,200 years ago, 
other mammals began to drop off precipi- 
tously, with herbivores vanishing more 
quickly than carnivores. By 12,900 years 
ago, all of the megafauna in the study— 
save coyotes—were gone. This unusu- 
ally detailed chart of an extinction, with 
vegetation-dependent herbivores dying 
off first, set researchers to exploring how 
plant life changed at the time. 

Museum paleobotanist Regan Dunn ex- 
amined pollen grains from a well-dated 
core sample taken from the bottom of Lake 
Elsinore, 100 kilometers southeast of La 
Brea. The mix of pollen species changed lit- 
tle at 13,600 years ago, but between 13,200 
and 12,900 years ago, pollen counts for 
both oak and juniper plunged, and pines, 
grasses, and chaparral plants spiked (see 
timeline, right). 

Dunn and colleagues suspected fires were 
the culprit, because the plants that grew 
back are fire adapted and thrive in fire- 
prone areas. But they couldn’t prove it. 

Then Dunn stumbled across work by 
Lisa Martinez, a Ph.D. student at UC Los 
Angeles. Martinez had sorted through 
the same Lake Elsinore core sample— 
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Charting California’s 
combustible past 


For millions of years, Southern California was 
covered in woodland, where large mammals such as 
saber-toothed cats, dire wolves, and camels roamed. 
New radiocarbon dates from the La Brea Tar Pits 
suggest these big animals began to disappear about 
13,200 years ago, just when charcoal and pollen from 
a nearby lake suggest rampant wildfires. 


r— Vegetation shifts 
| Most megafauna have 
BCE. | disappeared, anda 

4 chaparral landscape 
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California. 
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including at the key time period—for flecks 
of charcoal that could reveal the region’s 
history of wildfire. “I saw that and just 
about fell out of my chair,” Dunn says. 
Martinez soon joined the project. 

Her results matched the pollen and ex- 
tinction records perfectly. She found that 
about 13,200 years ago, charcoal accumu- 
lation jumped 30-fold and persisted at 
heightened levels for the next 400 years. 
That’s the lingering record of rampant wild- 
fires, which likely razed vast tracts of wood- 
lands and encouraged more fire-tolerant 
plants. Eventually, fire ushered in the cur- 
rent chaparral-dominated habitat. 

As with the 2020 Australian bushfires, 
which killed or displaced an estimated 
3 billion animals, these late Pleistocene 
wildfires probably killed many animals out- 
right. But the sudden loss of habitat and 
food may have been even more devastat- 


ing, Lindsey explains. Herbivore numbers 
dwindled as their food sources burned, and 
carnivore populations followed suit. 

Why so many fires? Paleoclimate re- 
cords reveal the region had previously gone 
through similar warm and dry spells with- 
out such dramatic ecological consequences. 
“The difference at the end of the last ice 
age,” Lindsey says, “is humans are here.” 


THE OLDEST unequivocal evidence of human 
presence in California comes from a partial 
skeleton known as the Arlington Springs 
Man, found on Santa Rosa Island and dated 
to about 12,900 years ago—after the mega- 
fauna were gone. But several other sites in- 
cluding White Sands in New Mexico, Paisley 
Caves in Oregon, and Monte Verde in Chile 
all point to humans being in the Americas 
by between 21,000 and 16,000 years ago. 

There’s no good reason some wouldn’t 
have settled in coastal Southern California, 
Lindsey says. “Once humans show up some- 
where, we're everywhere,” she says. 

And where humans go, fire follows. “Fire 
is a great tool,” says Dunn, who studies how 
fire has shaped human and plant communi- 
ties. “Humans are very adept at using fire 
for land management ... to create mosaic 
habitats, to create the right kinds of mate- 
rials for basketry, for harvesting grasshop- 
pers, for hunting strategies.” 

In the hands of Indigenous Americans, 
cultural burning techniques—also known 
as fire-stick farming—actually prevented 
the fuel buildup that causes large, out-of- 
control wildfires, like those that followed 
the arrival of European settlers, says 
Watkins, who is a member of the Choc- 
taw Nation of Oklahoma. Europeans 
suppressed fires and introduced cattle 
ranching, allowing more fuel to build up 
and creating larger and larger combustible 
grasslands. In recent years, Indigenous 
communities and fire scientists have called 
for land managers to employ more of the 
time-tested cultural management tech- 
niques to prevent forest fires. 

“Fire-stick technology has been docu- 
mented as a means of controlling fuel loads 
and contributes to less severe events,” 
Watkins says. “But those technologies have 
been suppressed as population density 
has expanded.” 

Back 13,000 years ago, there are few 
direct data points about how early Ameri- 
cans used fire, whether as simple campfires 
or to manage ecosystems. And whatever 
their fire practices, people in Southern 
California may have coexisted with the lo- 
cal megafauna for thousands of years. But 
then something changed. Previously pub- 
lished models of population growth based 
on archaeological and genetic evidence 
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Liquid asphalt bubbles up from a La Brea tar pit known as No. 61/67, from which most of the animals radiocarbon dated for a new study were excavated. 


suggest a demographic explosion in North 
America between about 15,000 and 13,000 
years ago. If that held true for South- 
ern California—a big “if,” the researchers 
admit—it means populations were boom- 
ing just as the Bolling-Allered began and 
the climate turned hotter and drier. 

The climate change would have made 
human-caused fires more likely to burn 
out of control, the researchers argue. The 
burning turned the landscape from wood- 
land to chaparral, dooming most of the 
region’s megafauna by 13,000 years ago. 
Mammoths and mastodons are rare in the 
La Brea record, but other sites 
show they hung on for another 
few thousand years, perhaps 
because they faced less pres- 
sure from human hunters and 
other predators. And coyotes 
adapted, as always, probably 
by switching to smaller prey, 
Dunn explains. 

Loren Davis, an archaeo- 
logist at Oregon State Univer- 
sity, largely agrees with the 
team’s conclusions. “People in 
the landscape can’t be blamed 
for all of the fires, but they’re 
probably making some of them,” 
he says. “And theyre making 
them right at this tipping point 
of environmental change.” 

Watkins, though, stresses 
that humans would have been 
just one source of fire. “The cli- 
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mate is already changing and local areas 
drying ... dry lightning storms could very 
easily be responsible for the increase in 
wildfires,” he says. He adds that although 
the authors stress a constellation of fac- 
tors, he wishes they had more explicitly 
emphasized that humans living in the re- 
gion weren’t directly responsible for the 
megafauna extinction, to help head off 
simplistic interpretations and blame. 
Others agree with the combination hy- 
pothesis. Huw Groucutt, a _ prehistoric 
archaeologist at the Max Planck Institute 
of Geoanthropology, speculated in a 2021 


UCLA graduate student Lisa Martinez sorts through sediment from Lake Elsinore, 
where she found a 30-fold increase in charcoal around the time of extinction. 


paper that humans may have indirectly 
contributed to the extinction of North 
American megafauna, perhaps by intensive 
hunting in ecosystems weakened by climate 
change. “This new study supports [a version 
of] that perspective,” he says. “It suggests 
humans had an impact on megafauna, but 
they did so in the context of ecosystems 
made vulnerable by climate change.” 

Perri cautions that other regions, such as 
the Southwest or Florida, may have a differ- 
ent story. “We shouldn’t view this extinction 
as a single event that happened in the same 
way everywhere.” Groucutt agrees, noting 
that it’s difficult to know for sure 
whether the last appearances of 
species at La Brea match their 
disappearances in the greater 
region. “While it’s a great sam- 
ple of dates, it’s still a sample ... 
from one part of one single site.” 

O’Keefe and Lindsey say 
more data are on the way. But 
for them, the existing evidence 
is enough to spin a cautionary 
tale for people living in South- 
ern California and other fire- 
prone regions today. “As we say 
in the paper, all the precon- 
ditions—a warming climate, 
increasing human population— 
that caused that state transition 
back then are reoccurring today,” 
O'Keefe says. “If that state tran- 
sition happens again, where are 
we going to end up?” & 
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Bacteria stretch and bend oil 
to feed their appetite 


Microbes reshape oil droplets to speed biodegradation 


By Terry J. McGenity 
and Pierre Philippe Laissue 


t is imperative to understand the fate of 
crude oil that escapes into the ocean to 
minimize its environmental, economic, 
and societal harm. Large amounts of 
crude oil enter the sea, as occurred this 
past month on a platform in the Gulf of 
Mexico. Oil does not easily mix with water, 
which can restrict oil degradation through 
microbes, a key pathway to remove hydro- 
carbons from the environment. However, 
turbulent seas and response measures, such 
as dispersant addition, generate smaller oil 
droplets that are attractive to voracious mi- 
crobial activity. On page 748 of this issue, 
Prasad et al. (1) report that bacteria attach 
to oil droplets, then grow as a film on the 
oil surface, sometimes reshaping spherical 
droplets into finger-like protrusions. This dy- 
namic process increases the oil’s surface area 
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and accelerates its biodegradation. The find- 
ing should improve predictions of spilled oil 
transport to ecologically sensitive sites. 
Certain marine microbes use hydro- 
carbons as a carbon and energy source. 
Collectively, they can metabolize and 
thereby degrade crude oil. Alcanivorax bor- 
kumensis (or Alca) is a widespread hydro- 
carbon-degrading marine bacterium that is 
frequently dominant in seawater contam- 
inated with crude oil (2). Alkanes are the 
simplest family of hydrocarbons in crude 
oil. Prasad et al. used Alcanivorax (which 
translates to “devourer of alkanes”) and 
droplets of n-hexadecane—with diameters 
between 10 and 200 tum—as a model system 
to examine how Alca forms a biofilm on a 
liquid surface, and how the interfacial oil- 
water properties affect oil degradation. 
Prasad et al. microscopically examined liv- 
ing cells of Alca on oil droplets in a micro- 
fluidic chamber and modeled the dynamic 
oil-seawater interface. Alca forms a biofilm, 
composed of cells in a polymer matrix, on 
the surface of oil droplets, which aids with 


Sted 


attachment (2-4). The authors observed 
that the time in which Alca was cultivated 
on n-hexadecane, before being given a new 
supply of alkane droplets to colonize, deter- 
mined the shape of the growing biofilm and 
of these droplets. When transferred after 1 
day of culture, Alca proliferated and formed 
thick spherical biofilms on the surface of oil 
droplets. By contrast, when transferred af- 
ter 5 days of culture, Alca proliferated and 
frequently formed a biofilm with finger-like 
(dendritic) protrusions (see the image). 

The question addressed by Prasad et al. is 
how Alca changes the shape of oil droplets. 
The authors built on work (2-5) showing 
that Alca’s growth on hydrocarbons for a 
long time reduces the water-alkane interfa- 
cial tension and causes the cells to become 
more hydrophobic. These biochemical ad- 
aptations increase the strength of bacte- 
rial adhesion to the alkane, likely through 
biosurfactants, which are biomolecules with 
both hydrophobic and hydrophilic moieties. 
These changes, together with end-to-end 
cellular alignment and the stress exerted 
by cell division, buckled and stretched the 
alkane sphere while stabilizing the resul- 
tant alkane-containing protrusions, some 
of which detached. The _ biofilm-induced 
expansion of the alkane’s surface area gave 
Alca better access to its carbon and energy 
source, resulting in a 3.5-fold increase in 
the rate of oil consumption in dendritic bio- 
films compared with spherical biofilms. 

Prasad et al. also demonstrated that Alca 
consumes approximately one cell’s volume 
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IMAGE: PRASAD ET AL. (1) 


A film of Alcanivorax borkumensis bacteria (green) 
adheres to oil droplets, creating protrusions filled 
with oil that boost the rate of microbial consumption. 


of alkane per hour, irrespective of the type 
of biofilm formed. However, the larger cal- 
culated rate of oil consumption by dendritic 
biofilms compared with spherical biofilms 
is attributed to the presence of more bacte- 
ria feeding simultaneously. Notably, the au- 
thors developed a mechanistic understand- 
ing of the process through live imaging to 
follow temporal changes in biofilm forma- 
tion at the spatial resolution of a single 
bacterial cell—close to 2 um. The authors 
further supported the findings by conduct- 
ing experiments guided by liquid-crystal 
theories and simulations. For example, they 
designed experiments that used microfluid- 
ics to control and measure deformations of 
an alkane droplet. 

There are several practical implications 
from the study of Prasad et al. Data on the 
diameter of oil droplets improve estimates 
of the biodegradation rate by microbes. This 
can help refine the predictions of the fate 
and transport of oil at sea (6). Whether non- 
spherical oil-droplet morphologies should be 
accounted for in oil-spill fate-and-transport 
models will depend on how frequently and 
in which environmental conditions dendritic 
droplets form. A first step to address this will 
be to count different oil-droplet morpholo- 
gies after a spill at sea or in realistic exper- 
iments. These experiments should take into 
consideration permutations of assembled 
and natural microbial communities that are 
preconditioned in different ways and exposed 
to mixtures of hydrocarbons and crude oil. 

Spherical and dendritic biofilm-coated oil 
droplets are likely to differ in viscous drag 
and density, which affects their buoyancy 
and thus the time spent in the ocean’s water 
column, rather than on the surface, shore, or 
sea floor (7). This is notable because microbes 
on oil droplets will have greater access to nu- 
trients such as nitrogen, phosphorus, or iron 
when in the water column. These nutrients 
can be a limiting factor for biodegradation in 
the nutrient-depleted ocean but not in exper- 
iments that use nutrient-rich media, such as 
in the study by Prasad et al. Bacteria-coated 
oil droplets rise more slowly than noncoated 
droplets in a water column, and complex 
nonspherical morphologies, resembling the 
detached dendrites with low oil content seen 
by Prasad et al., tend to float or even sink 
(8). The practical implications of these ob- 
servations depend on whether oil droplets 
originate from dispersed deep-sea oil spills or 
from surface spills. 

Further examination of the oil-water in- 
terfacial properties of spherical and den- 
dritic biofilms could include proteomic and 
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metabolite analyses to identify the amphi- 
philic biomolecules responsible for the mor- 
phological differences in biofilms. Improving 
temporal resolution with advanced imaging 
could refine biofilm dynamic processes. For 
example, light-sheet microscopy enabled 
the tracing of individual bacterial cells in 
the Vibrio cholerae biofilm over 16 hours, 
revealing an unexpected fountain-like flow 
of cells being transported to the growing 
edge of the biofilm (9). 

Single-cell spatial transcriptomics (which 
provides a profile of gene expression) could 
address why some alkane droplets remained 
spherical whereas others became dendritic 
when colonized by Alca grown for 5 days. On 
the fifth day, Alca is in stationary phase, a 
period with no net increase in the number of 
bacteria and when cells are physiologically 
different from those in the early stage of 
growth. At stationary phase, the bacterium 
Pseudomonas aeruginosa has a greater vari- 
ability in gene expression compared with 
that in earlier phases when proliferation is 
rapid (10). This variation within the popula- 
tion is presumably a bet-hedging strategy in 
the face of an unpredictable environment—a 
tactic that may also be used by Alca. The sta- 
tionary phase also saw increased expression 
of a gene involved in the synthesis of rham- 
nolipid, a biosurfactant that may prepare the 
cell for biofilm formation, highlighting par- 
allels between P. aeruginosa (10) and Alca. 

Alca alone cannot degrade the thousands 
of hydrocarbons in crude oil. This requires 
a diverse community of microbes (77), inter- 
acting with each other or sometimes com- 
peting (12). The findings of Prasad et al. lay 
important groundwork for examining more 
realistic scenarios. Translating from single- 
species microscale interactions with surfaces 
to macroscale multispecies processes will im- 
prove understanding of the mechanisms that 
drive the biodegradation and transport of oil 
spilled into the oceans. 
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Targeting 
cancer with 
molecular glues 


Molecular glues suppress 
the active form of the 
oncogenic protein KRAS 


By Jun O. Liu*?* 


utations of KRAS are prevalent in 
various cancers, but this oncogenic 
protein (oncoprotein) was consid- 
ered undruggable owing to its rela- 
tively flat surface that has no obvi- 
ous binding pockets for small 
molecules. A breakthrough came when the 
Cys” mutation, KRASS”°, was astutely ex- 
ploited using a small-molecule drug that 
covalently reacts with the thiol group of cys- 
teine (1). This led to the ensuing develop- 
ment of two US Food and _ Drug 
Administration (FDA)-approved KRAS°?° 
inhibitors, sotorasib and adagrasib, which 
have brought significant survival benefits to 
patients with cancers that have the KRAS¢S”° 
mutation (2, 3). These inhibitors selectively 
target the guanosine diphosphate (GDP)- 
bound inactive form of KRASS”°, which 
makes them slow acting, but patients can 
also develop drug resistance. On page 794 of 
this issue, Schulze et al. (4) report an ap- 
proach that targets the active guanosine tri- 
phosphate (GTP)-bound KRAS°”° with mo- 
lecular glues that recruit the cellular 
chaperone cyclophilin A (CYPA) to block 
KRAS*”*-jnduced oncogenic signaling. 
CYPA was initially discovered as a high- 
affinity receptor of the immunosuppressive 
drug cyclosporin A (5). Subsequently, CYPA 
was found to possess peptidyl] prolyl cis-trans 
isomerase activity that is involved in protein 
folding (6). Notably, the CYPA-cyclosporin A 
complex, but neither CYPA nor cyclosporin 
A alone, binds to and inhibits the protein 
phosphatase activity of calcineurin that is re- 
quired for calcium signaling in helper T cells. 
This revealed an unprecedented mode of ac- 
tion by a small molecule—working as molec- 
ular glue to bring two otherwise noninteract- 
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A film of Alcanivorax borkumensis bacteria (green) 
adheres to oil droplets, creating protrusions filled 
with oil that boost the rate of microbial consumption. 


of alkane per hour, irrespective of the type 
of biofilm formed. However, the larger cal- 
culated rate of oil consumption by dendritic 
biofilms compared with spherical biofilms 
is attributed to the presence of more bacte- 
ria feeding simultaneously. Notably, the au- 
thors developed a mechanistic understand- 
ing of the process through live imaging to 
follow temporal changes in biofilm forma- 
tion at the spatial resolution of a single 
bacterial cell—close to 2 um. The authors 
further supported the findings by conduct- 
ing experiments guided by liquid-crystal 
theories and simulations. For example, they 
designed experiments that used microfluid- 
ics to control and measure deformations of 
an alkane droplet. 

There are several practical implications 
from the study of Prasad et al. Data on the 
diameter of oil droplets improve estimates 
of the biodegradation rate by microbes. This 
can help refine the predictions of the fate 
and transport of oil at sea (6). Whether non- 
spherical oil-droplet morphologies should be 
accounted for in oil-spill fate-and-transport 
models will depend on how frequently and 
in which environmental conditions dendritic 
droplets form. A first step to address this will 
be to count different oil-droplet morpholo- 
gies after a spill at sea or in realistic exper- 
iments. These experiments should take into 
consideration permutations of assembled 
and natural microbial communities that are 
preconditioned in different ways and exposed 
to mixtures of hydrocarbons and crude oil. 

Spherical and dendritic biofilm-coated oil 
droplets are likely to differ in viscous drag 
and density, which affects their buoyancy 
and thus the time spent in the ocean’s water 
column, rather than on the surface, shore, or 
sea floor (7). This is notable because microbes 
on oil droplets will have greater access to nu- 
trients such as nitrogen, phosphorus, or iron 
when in the water column. These nutrients 
can be a limiting factor for biodegradation in 
the nutrient-depleted ocean but not in exper- 
iments that use nutrient-rich media, such as 
in the study by Prasad et al. Bacteria-coated 
oil droplets rise more slowly than noncoated 
droplets in a water column, and complex 
nonspherical morphologies, resembling the 
detached dendrites with low oil content seen 
by Prasad et al., tend to float or even sink 
(8). The practical implications of these ob- 
servations depend on whether oil droplets 
originate from dispersed deep-sea oil spills or 
from surface spills. 

Further examination of the oil-water in- 
terfacial properties of spherical and den- 
dritic biofilms could include proteomic and 
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metabolite analyses to identify the amphi- 
philic biomolecules responsible for the mor- 
phological differences in biofilms. Improving 
temporal resolution with advanced imaging 
could refine biofilm dynamic processes. For 
example, light-sheet microscopy enabled 
the tracing of individual bacterial cells in 
the Vibrio cholerae biofilm over 16 hours, 
revealing an unexpected fountain-like flow 
of cells being transported to the growing 
edge of the biofilm (9). 

Single-cell spatial transcriptomics (which 
provides a profile of gene expression) could 
address why some alkane droplets remained 
spherical whereas others became dendritic 
when colonized by Alca grown for 5 days. On 
the fifth day, Alca is in stationary phase, a 
period with no net increase in the number of 
bacteria and when cells are physiologically 
different from those in the early stage of 
growth. At stationary phase, the bacterium 
Pseudomonas aeruginosa has a greater vari- 
ability in gene expression compared with 
that in earlier phases when proliferation is 
rapid (10). This variation within the popula- 
tion is presumably a bet-hedging strategy in 
the face of an unpredictable environment—a 
tactic that may also be used by Alca. The sta- 
tionary phase also saw increased expression 
of a gene involved in the synthesis of rham- 
nolipid, a biosurfactant that may prepare the 
cell for biofilm formation, highlighting par- 
allels between P. aeruginosa (10) and Alca. 

Alca alone cannot degrade the thousands 
of hydrocarbons in crude oil. This requires 
a diverse community of microbes (77), inter- 
acting with each other or sometimes com- 
peting (12). The findings of Prasad et al. lay 
important groundwork for examining more 
realistic scenarios. Translating from single- 
species microscale interactions with surfaces 
to macroscale multispecies processes will im- 
prove understanding of the mechanisms that 
drive the biodegradation and transport of oil 
spilled into the oceans. 
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Molecular glues suppress 
the active form of the 
oncogenic protein KRAS 


By Jun O. Liu*?* 


utations of KRAS are prevalent in 
various cancers, but this oncogenic 
protein (oncoprotein) was consid- 
ered undruggable owing to its rela- 
tively flat surface that has no obvi- 
ous binding pockets for small 
molecules. A breakthrough came when the 
Cys” mutation, KRASS”°, was astutely ex- 
ploited using a small-molecule drug that 
covalently reacts with the thiol group of cys- 
teine (1). This led to the ensuing develop- 
ment of two US Food and _ Drug 
Administration (FDA)-approved KRAS°?° 
inhibitors, sotorasib and adagrasib, which 
have brought significant survival benefits to 
patients with cancers that have the KRAS¢S”° 
mutation (2, 3). These inhibitors selectively 
target the guanosine diphosphate (GDP)- 
bound inactive form of KRASS”°, which 
makes them slow acting, but patients can 
also develop drug resistance. On page 794 of 
this issue, Schulze et al. (4) report an ap- 
proach that targets the active guanosine tri- 
phosphate (GTP)-bound KRAS°”° with mo- 
lecular glues that recruit the cellular 
chaperone cyclophilin A (CYPA) to block 
KRAS*”*-jnduced oncogenic signaling. 
CYPA was initially discovered as a high- 
affinity receptor of the immunosuppressive 
drug cyclosporin A (5). Subsequently, CYPA 
was found to possess peptidyl] prolyl cis-trans 
isomerase activity that is involved in protein 
folding (6). Notably, the CYPA-cyclosporin A 
complex, but neither CYPA nor cyclosporin 
A alone, binds to and inhibits the protein 
phosphatase activity of calcineurin that is re- 
quired for calcium signaling in helper T cells. 
This revealed an unprecedented mode of ac- 
tion by a small molecule—working as molec- 
ular glue to bring two otherwise noninteract- 
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Inhibition of a KRAS mutant with molecular glue 

KRAS induces cell proliferation by exchanging GDP (inactive state) with GTP (active state). This leads to its 
association with and activation of downstream signaling proteins, such as RAF and PI3K. The Gly!*-to-Cys! 
mutation of KRAS impairs its ability to hydrolyze GTP to GDP, causing excessive cell proliferation and cancer. 
Sotorasib and adagrasib irreversibly inhibit KRAS“°-GDP, whereas the molecular glue RMC-4998 selectively 


targets KRAS°°-GTP by forming a ternary complex with the prolyl isomerase CYPA. 
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CYPA, cyclophilin A; GDP, guanosine diphosphate; 
GTP, guanosine triphosphate; PI3K, phosphatidylinositol-3 kinase. 


ing proteins together to manifest its activity 
(7). Aside from cyclosporin A, two other mac- 
rocyclic immunosuppressive drugs, FK506 
and rapamycin, were also found to act as mo- 
lecular glues (7, 8). A search for cyclosporin 
A-like natural products led to the discovery 
of sanglifehrin A, which also binds to CYPA 
(9, 10). A high-resolution crystal structure of 
the CYPA-sanglifehrin A complex revealed 
the presence of a minimal tripeptide motif in 
sanglifehrin A that mediates its interaction 
with CYPA (11). 

Schulze et al. began their search for an in- 
hibitor of KRAS°”°-GTP by tethering a cys- 
teine-reacting moiety to the tripeptide CYPA- 
binding motif of sanglifehrin A. The resultant 
tool compound induced the formation of a 
ternary complex between CYPA and KRAS¢&”° 
bound to a GTP analog (GMPPNP) and cova- 
lently modified Cys”. Based on a high-resolu- 
tion crystal structure of the complex, Schulze 
et al. optimized the tool compound through 
iterative structure-guided design, synthe- 
ses, and evaluation of new analogs, which 
eventually yielded a lead compound called 
RMC-4998 that exhibited high potency and 
selectivity toward KRASS”°-GTP, blocking its 
signaling activity (see the figure). This paved 
the way to the subsequent development of a 
drug candidate, RMC-6291, that is now un- 
dergoing testing in a phase I clinical trial 
of patients with advanced KRAS°”°-mutant 
cancers (NCT05462717). 

The ability of RMC-4998 to induce the for- 
mation of the ternary complex was attributed 
in part to the neomorphic surface of CYPA 
that is induced by RMC-4998 binding and in- 
volves at least four residues from CYPA: Asn”, 
Lys®, Arg™*, and Trp™. It is remarkable that 
CYPA, which does not interact with KRAS®”°, 
is capable of forging such extensive surface 
interactions with KRAS°’° upon binding 
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to RMC-4998. By comparison, seven CYPA 
residues directly interact with calcineurin in 
the CYPA-cyclosporin A-calcineurin ternary 
complex, of which only Arg™® is also used in 
CYPA-RMC-4998-KRAS¢"°-GTP (12). That 
CYPA is able to forge direct interactions with 
different proteins in a ligand-dependent fash- 
ion, including with calcineurin (through cy- 
closporin A), inosine monophosphate dehy- 
drogenase-2 (through sanglifehrin A) (9), and 
now GTP-bound KRAS¢”° (through RMC- 
9448), suggests that the surface of CYPA may 
have an unusual ability to adapt to a new in- 
teracting surface environment. 

As a chaperone, CYPA may have a 
greater intrinsic plasticity and adaptability 
because, as a prolyl isomerase, it interacts 
with diverse protein substrates. Similar 
surface plasticity and adaptability is seen 
in FK506-binding protein 1A (FKBP1A), 
another prolyl isomerase that can mediate 
the formation of ternary complexes with 
calcineurin (through FK506), mechanis- 
tic target of rapamycin (mTOR, through 
rapamycin), the centrosome-associated 
protein CEP250 (through WDBO002), and 
equilibrative nucleoside transporter 1 
(ENT1, through rapadocin) (7, 8, 13, 14). 
The surface plasticity and adaptability of 
both CYPA and FKBPIA should bode well 
for future discovery and design of molecu- 
lar glues for other targets. 

The CYPA-binding molecular glues have 
several advantages over conventional small- 
molecule inhibitors of KRAS°”° such as so- 
torasib and adagrasib. The formation of a 
more-extensive composite surface of the 
CYPA-RMC-4998 complex rendered it pos- 
sible to selectively target KRAS°”°-GTP, mak- 
ing RMC-4998 insensitive to growth factor 
stimulation that increases the concentration 
of KRAS°”°-GTP and desensitizes cells to so- 


torasib and adagrasib. The presence of CYPA 
in the ternary complex occluded a space on 
top of KRAS°”°, preventing the binding of 
known downstream effector proteins, includ- 
ing RAF and phosphatidylinositol-3 kinase 
(PI3K). Moreover, the multiplicity of inter- 
actions of KRAS°’°-GTP with both RMC- 
4998 and surface residues of CYPA makes it 
more difficult for drug-resistant mutants in 
KRAS or CYPA to emerge. Indeed, Schulze et 
al. showed that mutation of a CYPA residue 
involved in the interaction with KRAS¢’° 
incompletely ablated CYPA-RMC-4998- 
KRASS”* complex formation. Compared with 
the existing KRAS°”°-GDP inhibitors, it will 
be interesting to see whether RMC-6291 is 
superior, with greater efficacy and more-du- 
rable responses in patients. 

The success of the development of RMC- 
4998 and RMC-6291 epitomizes the power 
of combining structural biology, rational de- 
sign, and medicinal chemistry in molecular- 
glue discovery. This bottom-up approach is in 
contrast to the alternative and complemen- 
tary forward chemical-genetics approach in 
which a library of structurally diverse, glue- 
like molecules is generated for both pheno- 
typic and target-based screens (14). A combi- 
nation of both approaches is likely to further 
accelerate discoveries of new molecular glues 
(15). With the seemingly untouchable active 
states of KRASS”° conquered, it will be inter- 
esting to explore whether these approaches 
can be applied to other KRAS mutants, small 
GTPases, trimeric G proteins, and even other 
classes of “undruggable” targets. 
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Extracting resources from abandoned mines 


Recovering minerals and metals from abandoned mines could aid decarbonization 


By Zhongwen Bao, Carol J. Ptacek, 
and David W. Blowes 


he development of environmentally 

friendly technologies (e.g., electric 

vehicles, wind turbines, solar panels, 

and lithium-ion batteries) to achieve 

a low-carbon future will require large 

quantities of critical minerals and 
metals (7). Mining and extraction of these 
materials are anticipated to generate sub- 
stantially larger volumes of mine wastes, 
including waste rock and tailings (2). The 
release of contaminated water from mine 
wastes can cause long-term environmental 
damage and land degradation, posing chal- 
lenges for pollution control and environ- 
mental remediation. Resource extraction 
from contaminated water, mine wastes, 
and mine workings (e.g., unexploited open 
pits and underground tunnels) at aban- 
doned mines could potentially address the 
increased demands of critical minerals 
and metals for decarbonization. However, 
careful planning is required to ensure that 
negative environmental effects are not ex- 
acerbated during resource recovery at aban- 
doned mines. 

Mining activities can have long-term pos- 
itive impacts by supplying valuable miner- 
als and metals to advance economic devel- 
opment, energy transition, infrastructure 
construction, and technological innovation. 
However, mining has also caused environ- 
mental and societal harm. Vast quantities 
of potentially chemically reactive wastes 
are accumulated over large land footprints 
at mine sites. Globally, land that has been 
affected by mining amounts to 66,000 
km’, including waste-rock piles, tailings 
impoundments, and open pits as well as 
mining and processing infrastructure (3). 
Much of this mining-affected land includes 
abandoned mines, which are neither in op- 
eration nor managed. Mines are abandoned 
for a variety of reasons, including depletion 
of ores, commodity price fluctuations, and 
technical challenges associated with mining 
deeper ores. The global area of land use and 
contamination level at abandoned mines 
are unknown. Yet, there is a distinct con- 
trast between the number of active mines 
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and the much larger number of abandoned 
mines. For example, in Canada, there are 
~200 active mines compared with more 
than 10,000 abandoned mines. 

Each abandoned mine has distinct attri- 
butes, including the physicochemical and 
geotechnical characteristics of mine-waste 
deposits and mine voids. These characteris- 
tics strongly influence the impacts to land, 
water, and ecosystems, including releases of 
water contaminated with high concentra- 
tions of toxic metals and metalloids [e.g., 
acid mine drainage (AMD)] and releases 
of carbon dioxide (CO,). Chronic releases 
of AMD and catastrophic failures of tail- 
ings dams lead to “dead zones” at mine 
sites, which exhibit biodiversity loss, land 
deterioration, and degraded surface water 
and groundwater and can be detrimental to 
human health (4). For instance, in 1999, an 
extremely low pH of -3.6 was reported in 
AMD from the Richmond Mine of the Iron 
Mountain copper deposit, USA, which was 
mined in the early 1900s (5). This AMD re- 
sulted in fish death and vegetation denuda- 
tion. Additionally, release of dissolved con- 
taminants and alteration of groundwater 
and surface water flow systems can harm 
aquatic ecosystems, such as effects on sal- 
monids (ray-finned fish) and their habitat 
in northwestern North America as a result 
of tailings dam failures, such as that at 
Mount Polley Mine, Canada, in 2014 (6). 

Although the stored mass of mine wastes 
is uncertain, an estimated global mass 
of tailings totals 223 billion metric tons 
generated from 1771 to 2019, whereas the 
total mass of waste rock is estimated to 
be about 10 times that of the mass of tail- 
ings (7). Exposure of sulfide-bearing mine 
wastes to water and oxygen triggers com- 
plex microbial-catalyzed biogeochemical 
reactions that generate acidic, neutral, and 
saline mine drainage. Mine wastes prone to 
AMD frequently cause the most severe wa- 
ter quality problems that persist for thou- 
sands of years (5), and thus AMD has been 
the focus of intense research and environ- 
mental remediation. However, neutral and 
saline mine drainage also have the poten- 
tial to cause environmental degradation. 
Selective mining, segregation and encapsu- 
lation, subaqueous disposal, lime or lime- 
stone addition, cover construction to limit 
water infiltration or oxygen ingress, and 
passive techniques (including constructed 


wetlands, bioreactors, and permeable re- 
active barriers) have been widely applied 
to facilitate AMD mitigation either at the 
source or along the migration pathway (8). 
Typically, a combination of these techniques 
is implemented. 

These AMD mitigation strategies may 
demonstrate short-term success; however, 
their long-term performance remains un- 
certain under changing climate scenarios. 
For example, subaqueous disposal of tail- 
ings into inland water bodies and oceans 
limits AMD generation by reducing the 
oxygen supply. It was found that ocean 
acidification and the present temperature 
increase caused by climate change can en- 
hance metal leaching from tailings disposed 
below a deep-water cover, potentially de- 
creasing water quality and causing negative 
environmental effects (9). Without perma- 
nent remediation, the long-term impacts of 
mine wastes on terrestrial and marine eco- 
systems will persist. 

Supplies from active mines cannot cur- 
rently meet the increased demands for criti- 
cal minerals and metals for decarbonization 
measures. Individual mines are operated to 
recover targeted commodities, which are 
determined by the economics of mining, 
processing, and global availability. Mine 
wastes at abandoned mines may contain 
recoverable concentrations of rare earth 
elements (REEs), platinum group elements, 
and metals—e.g., cobalt, copper, lithium, 
and nickel—that are more accessible than 
the low-grade ores that are currently mined 
(10). Many of these elements are indispens- 
able in technology development to achieve 
carbon-zero goals. Therefore, AMD, mine 
wastes, and mine workings at abandoned 
mines are being evaluated as potential 
sources for these critical minerals and met- 
als. Integration of resource exploration and 
recovery with environmental remediation 
provides a promising opportunity to gain 
value from mine wastes while tackling the 
environmental challenges associated with 
abandoned mines. 

Assessing the resource potential of criti- 
cal minerals and metals in AMD, mine 
wastes, and mine workings at abandoned 
mines is the first step. Abandoned mines 
are often located in remote locations with 
sensitive environments. Appropriate reme- 
diation designs, characterization studies, 
and monitoring programs are needed to as- 
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Resource recovery and environmental remediation at abandoned mines 

Minerals and metals that are in high demand could be recovered from abandoned mine wastes (tailings and waste rock) through heap leaching, which 
could also mitigate environmental contamination. Combining this approach with environmental remediation of any residual mine waste, such as 

with multilayer vegetated soil covers [including a geosynthetic clay liner (GCL)], could achieve long-term mitigation of the impacts of abandoned mines. 
However, potential pollution from heap leaching should also be addressed. 


Mine wastes 
Tailings pile 


sess the spatial distribution, bioavailability, 
and migration pathways of contaminants. 
Many national inventories of abandoned 
mines with different risk categories and 
national registries have been established. 
For example, national mine-waste regis- 
tries in France, Hungary, Italy, Portugal, 
Slovenia, Spain, and the UK were created 
with basic information that can be used 
for evaluating potential resource recovery 
(11). Resource potentials of other critical 
minerals and metals (particularly REEs) in 
mine wastes should be jointly assessed in 
conjunction with site-specific characteriza- 
tion and monitoring programs to enhance 
knowledge of the behavior of critical miner- 
als and metals as well as their geochemical 
interactions. These efforts will complement 
the knowledge gap in national mine-waste 
registries and facilitate decision-making to 
shift abandoned mines from a source of pol- 
lution to a resource for mineral recovery. 

Advances in exploration and recovery 
techniques of critical minerals and metals 
from different waste streams are promis- 
ing. In Europe, underwater robots, al- 
though still at the prototype stage, were 
successfully demonstrated for exploration 
of critical minerals and metals at flooded 
abandoned mines without dewatering costs 
and groundwater impacts (72). Traditional 
AMD treatment through neutralization 
using lime or limestone generates large 
quantities of sludges that are rich in iron 
oxyhydroxides, REEs, and other valuable 
minerals. High-grade aluminum, REEs, 
cobalt, and manganese were recovered 
from sludges during AMD treatment using 
neutralization reagents through a three- 
staged, pH-dependent precipitation and 
crystallization process (73). 
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Heap leaching is a method for economi- 
cal recovery from low-grade ores for the ex- 
traction of, for example, copper, gold, and 
uranium (J/4). This method requires the 
application of leaching solutions (such as 
sodium cyanide for gold) through ore-bear- 
ing heap pads. Leaching solutions dissolve 
targeted minerals and metals into pregnant 
leach solution ponds and are subsequently 
recovered; the barren solutions can then be 
mixed with fresh reagents and recirculated 
into the heap-leaching pads. Given that 
mine wastes might have higher amounts 
of critical minerals and metals compared 
with the low-grade ores that are currently 
mined, heap leaching could function as a 
feasible technique for resource recovery at 
abandoned mines (10, 15). A combination 
of resource recovery through heap leaching 
with implementation of environmental re- 
mediation (e.g., a multilayer vegetated soil 
cover) May serve as a potentially sustain- 
able solution for mine-waste management 
and pollution control (see the figure). 

Caution is needed to prevent the intro- 
duction of new environmental pollution and 
harm during resource recovery from mine 
wastes. For instance, a drainage protection 
layer should be implemented to prevent the 
infiltration of leach solutions into underly- 
ing soils and groundwater to avoid the long- 
term retention of hazardous materials in 
the subsurface. Residual solid wastes after 
heap leaching need further characterization 
for proper disposal through environmental 
remediation. Notably, industrial applica- 
tions of heap leaching are mostly used for 
the recovery of copper, gold, and uranium 
from low-grade ores, whereas heap leaching 
for resource recovery from mine wastes is 
still under development. Further research is 


Environmental remediation 


Soil remediation pile 
GCL = 
Residual 
mine 

wastes ———es 


needed to evaluate the potential impacts of 
the commercialization of these exploration 
and recovery techniques through thorough 
monitoring and evaluation programs in lab- 
oratory-, pilot-, and industrial-scale appli- 
cations and to make the extraction of criti- 
cal minerals and metals from these waste 
streams environmentally friendly, socially 
acceptable, and economically feasible. ™ 
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CELL BIOLOGY 


What is a cell type? 


A next step for cell atlases should be to chart 
perturbations in human model systems 


By Jonas Simon Fleck’, J. Gray Camp’, 
Barbara Treutlein? 


nternational efforts are underway to 

provide a comprehensive survey of 

cells across the human life span (J, 2). 

These efforts assign cell “types” on the 

basis of a set of information-rich mo- 

lecular features (the cell phenotype), 
which include portions of the transcrip- 
tome, epigenome, and proteome, as well 
as the developmental lineage (3, 4). These 
features can be quantified in single cells 
in suspension after tissue dissociation or 
within intact tissue. The resulting cell at- 
lases provide insight into the organization, 
ontogeny, and evolution of human tissues. 
They also help to explore disease suscepti- 
bilities, navigate therapy development, and 
benchmark cell and tissue engineering. 
However, the atlas data can often identify 
phenotypic diversity (referred to as differ- 
ent cell states) among cells of the same 
type, raising the following question: What 
is a human cell type? 

One view is that a cell type or state not 
only involves the developmental history 
of the cell and its current set of molecu- 
lar features but is also linked to how the 
cell would respond to an environmental 
change or other perturbation. A perturba- 
tion is any input that alters the phenotype 
of a cell. They can be naturally occurring or 
experimental and include genetic change; 
chemical, physical, or electrical stimulus; 
and pathogen infection. The complexity 
of perturbation-induced changes can vary 
substantially: Certain genetic loss-of-func- 
tion mutations affect a single pathway, and 
others can alter many cellular processes. 
Emerging experimental and computa- 
tional techniques could enable the sys- 
tematic profiling of perturbation-induced 
phenotypic changes at large scale, ideally 
with spatial resolution. This would enable 
human cell phenoscapes—the landscape of 
all possible cell phenotypes—to be charted. 
The totality of phenotypes under all pos- 
sible perturbations describes a local phe- 
noscape for each individual cell type. 
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Existing cell atlases describe current fea- 
tures, and sometimes the developmental his- 
tory, of a cell but cannot provide information 
on the factors that dictate development or 
predict future cell states. In addition, how 
to transform cells from one state to another 
and how to generate a defined cell state in 
vitro are still not known. Although in silico 
perturbation methods have been proposed 
(5), they are, at present, difficult to validate 
systematically because experimental data 
are scarce. If it became possible to predict 
the response of a cell from its current state, 
the existing cell atlases would be a rich re- 
source for in silico screening. 

Some perturbations alter a cell’s molecu- 
lar state in a reversible manner: The cell 
responds to the stimulus by transitioning 
from a stable state to a new, unstable state 
and returns once the stimulus is removed. 
Examples of such perturbations include 
transient exposure to signaling inhibitors 
or targeted repression of a gene’s expres- 
sion. By contrast, some perturbations can 
irreversibly change the type of the cell 
by inducing a new stable state. Examples 
include cell states that are engineered 
through transcription factor conversion of 
one cell type (e.g., fibroblast or pluripotent 
stem cell) into another (e.g., neuron) or 
created through the acquisition of genetic 
changes that convert from a normal to a 
cancerous cell type. In such cases, the new 
cell state is maintained after the original 
perturbation occurs. Rather than changing 
the current state of the cell, certain per- 
turbations can alter future cell develop- 
ment. For example, genetic or epigenetic 
changes can influence cell differentiation 
paths in later ontogenetic stages and lead 
to enrichment or depletion in different de- 
velopmental lineages (6, 7). The effects of 
this type of perturbation are highly depen- 
dent on the developmental stage in which 
it is induced. These perturbations can be 
viewed as altering or eroding the devel- 
opmental landscape such that certain ter- 
minal states are excluded, whereas others 
may become accessible. 

Reference atlases of healthy and dis- 
eased autopsy or biopsy tissues from indi- 
viduals with different genetic backgrounds 
will be vital to elucidate parts of the phe- 
noscape by cataloging naturally occurring 
base states and perturbations thereof (see 


associations between perturbation and 
cell phenotype are usually correlative. 
Furthermore, the number of possible per- 
turbations is vast. 

To address these limitations, perturba- 
tions need to be induced and cell responses 
characterized under controlled experimen- 
tal conditions in high throughput, which 
requires the use of in vitro model systems. 
However, studying the effects of perturba- 
tions at scale has trade-offs: Single-cell se- 
quencing of massive cell numbers in suffi- 
cient detail can be extremely expensive, and 
delivering many perturbations in parallel 
can be difficult to achieve. Proposed com- 
putational strategies suggest that the full 
state of a modality (e.g., the transcriptome 
or the epigenome) can be reconstructed 
using relatively few measurements (8). In 
addition, industrial engineering and auto- 
mation solutions are on the horizon to im- 
prove throughput and lower costs. There 
are also genetic (9) and chemical toolkits 
available that make it increasingly feasible 
to induce and characterize many different 
perturbations in high-throughput screens. 
Testing the effects of a large number of 
perturbations requires sampling many 
cells to retain statistical power. Such high- 
throughput sampling might be achieved 
through combinatorial barcoding of disso- 
ciated cells or image-based in situ readouts 
(3, 10). 

At present, high-throughput induction 
of perturbations can be used to elucidate 
the effects of drugs on cell phenotypes in 
relatively simple two-dimensional (2D) 
monocultures (/1). However, human physi- 
ology is complex, and tissues are composed 
of many different cell types and states that 
dynamically interact. Probing the effects 
of genetic and chemical perturbations on 
a single cell type in isolation has limited 
predictive value. Context matters, and per- 
turbation response is dependent on the 
particular combination of cell states that 
is present in any multicellular system. 
Therefore, perturbations to a given cell 
state should ideally be studied within its 
tissue microenvironment. 

Human 3D cell-culture model systems 
(such as organoids) can recapitulate some 
cell states and microenvironmental inter- 
actions that are observed in their primary 
tissue counterparts. Protocols have been 
developed to generate human stem cell- 
derived tissues that recapitulate various 
physiological features of many different 
organ systems (72). For example, brain or- 
ganoids mirror progenitor and neuronal 
cell states and show similar cytoarchitec- 
ture, enabling the study of neurogenesis 
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in multiple brain regions (73). Intestinal 
organoids contain stem, absorptive, and 
secretory cell types (12). Because organoids 
can be grown under controlled and scal- 
able culture conditions, they can be sub- 
jected to diverse perturbations and their 
response can be profiled using high-infor- 
mation content methods. Nevertheless, all 
human model systems are limited in their 
capacity to recapitulate the full complexity 
of human organs and interorgan interac- 
tions. In particular, most organoid systems 
are derived from a single germ layer and 
therefore often lack important tissue-sup- 
porting cell types, such as vasculature or 
immune compartments. Further optimiza- 
tion of organoid systems is expected to en- 
hance their predictive value. 

Another limitation of existing cell atlases 
is that cell-type annotations mostly rely on 
single-cell transcriptional or epigenomic 
profiles. New methods are emerging that 
provide information on the proteome, me- 
tabolome, lipidome, and other molecular 
features of individual cells from dissoci- 
ated tissue and in situ. This enhanced 
characterization, along with computa- 
tional approaches to integrate the data, 
will provide more nuanced distinctions 
of cell types and states. Clarifying what 
functionally constitutes a cell type might 


also be aided by the proposed perturba- 
tion atlases. Perturbing one set of cellular 
features (e.g., chromatin organization) and 
profiling another (e.g., proteome) can help 
establish causal links between different 
features and illuminate how their interac- 
tion shapes complex emergent phenotypes. 
Consequently, the perturbation landscape 
could help distinguish cells based on func- 
tion rather than observed features, which 
might ultimately contribute to a more ho- 
listic notion of cell type. 

Unlike tissue atlases that catalog a finite 
set of cell states, phenoscape atlases could 
cover an arguably infinite set of possible 
single and combinatorial perturbations. It 
is unclear under what conditions such an 
atlas can ever be considered “complete,” 
making it crucial to learn to extrapolate 
from observed perturbations to unseen 
data. It might be that each cell type has a 
finite set of adjacent states that are pos- 
sible to reach or that different perturba- 
tions lead to convergent effects across cell 
types. Such redundancy can be revealed 
by mapping phenoscapes across cell types 
and tissues, and it can be leveraged by 
training machine- and deep-learning mod- 
els to yield a compressed representation of 
perturbation effects. A number of models 
have already been developed to learn links 


Charting cell phenoscapes 


Single cells in primary-tissue samples can be analyzed using multimodal phenotyping methods to provide 
information on the effect of perturbations on cell features. Complex three-dimensional human cell-culture 
models recapitulate aspects of human physiology and are amenable to high-throughput perturbation 
experiments followed by multimodal phenotyping. The totality of the human-cell phenotypic landscape 
(“phenoscape”) can be simplified and visualized as a topological and spherical map, where the totipotent 


cells at the core differentiate toward the surface topology. 
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between perturbations and transcriptomic 
changes from training data and to predict 
responses of unseen cell states, new com- 
binations of tested perturbations, or even 
unseen perturbations (/4). In addition to 
predictive power, it is also a goal to make 
models interpretable and to identify which 
aspects of the inputted data enabled the 
predictions, with the aim of gaining mech- 
anistic insight or identifying drug targets. 
However, the accuracy of predictions and 
the ability of these models to extrapolate 
to unseen contexts is highly dependent on 
the quality, quantity, and diversity of train- 
ing data. As a consequence, development 
and evaluation of such models will greatly 
benefit from large, systematically collected 
datasets on the effects of perturbation in 
different types of human multicellular 
model systems. 

Ultimately, predictive modeling of pertur- 
bations may provide a way to establish when 
this seemingly boundless phenoscape has 
been sufficiently charted. With appropriate 
models, an expanding phenoscape atlas will 
lead to reduced uncertainty and enhanced 
accuracy for predictions of the effects of 
unseen perturbations. Therefore, “complete- 
ness,” at least within a specific context, would 
be approached if the effects of unseen per- 
turbations can be accurately predicted and 
gathering further data results in marginal 
improvements in prediction accuracy. Model 
uncertainty can then serve as a valuable 
compass for informing future study designs, 
with the aim of systematically mapping unex- 
plored regions of the phenoscape. 
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ENVIRONMENTAL POLICY 


Create a culture of experiments 
in environmental programs 


Organizations need a better “learning by doing” approach 
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n understanding of cause and effect 

is central to the design of effective 

environmental policies and _ pro- 

grams. But environmental scientists 

and practitioners typically rely on 

field experience, case studies, and 
retrospective evaluations of programs that 
were not designed to generate evidence 
about cause and effect. Using such methods 
can lead to ineffective or even counterpro- 
ductive programs. To help strengthen infer- 
ences about cause and effect, environmen- 
tal organizations could rely more on formal 
experimentation within their programs, 
which would leverage the power of science 
while maintaining a “learning by doing” ap- 
proach. Although formal experimentation is 
a cornerstone of science and is increasingly 
embedded in nonenvironmental social pro- 
grams, it is virtually absent in environmen- 
tal programs. We highlight key obstacles to 
such experimentation and suggest opportu- 
nities to overcome them. 

By “formal experimentation,’ we mean 
the deliberate creation of spatial or tempo- 
ral variation in program implementation 
with the intent of quantifying impacts and 
elucidating mechanisms. For example, con- 
sider an environmental agency that wants 
to learn how best to encourage polluters 
to comply with environmental regulations. 
Instead of implementing a single change 
in auditing practices across all polluting 
facilities, the agency could randomly vary 


implementation of two auditing practices 
and contrast how facilities respond (see 
the figure) [for an analogous real-world 
example, see (J)]. By creating deliber- 
ate variation in how programs are imple- 
mented, program administrators can more 
easily learn about the features that make 
programs effective. Although experimenta- 
tion in natural resource management has 
a long history, including in the context of 
adaptive management, we focus on embed- 
ding experiments in the implementation of 
policies or programs that affect human be- 
havior. For example, in a not-atypical type 
of environmental policy experiment that 
tests whether thinning a reforested plot 
leads to more harvestable timber, human 
behavior is controlled by the experimental- 
ist, whereas in a much less common type 
of experiment that tests alternative design 
features of a program that encourages more 
reforestation behavior, human behavior is 
endogenous and uncertain. 

Despite the benefits of adding experi- 
mental variation to program implementa- 
tion, as demonstrated in nonenvironmen- 
tal contexts such as health and education, 
environmental organizations rarely do 
so. Consider two US federal agencies with 
substantial environmental program port- 
folios: the US Environmental Protection 
Agency (USEPA) and the US Department 
of Agriculture (USDA). In the past 30 years, 
each has embedded formal experimenta- 
tion in their environmental programs fewer 
than a half dozen times. In Europe, we know 
of only a single example of formal experi- 
mentation embedded within government- 
implemented environmental programs (2). 
Formal experimentation is similarly almost 
nonexistent among nongovernmental and 
multilateral environmental organizations. 
Although environmental actors engage in 
thousands of informal “experiments” every 


designed to test the implicit hypotheses 
that justify the implementation of current 
programs or understand how to make these 
programs more effective. 

Formal experimentation in environmental 
programs is absent because science typically 
stops when implementation starts. Over the 
past five decades, governmental and non- 
governmental actors have invested substan- 
tial resources to understand the status and 
trends of myriad environmental indicators. 
These investments have been motivated by 
scientific uncertainty about how complex 
environmental systems function and by a 
recognition that reducing this uncertainty is 
critical to designing effective programs. 

Yet uncertainty also plagues program effi- 
cacy. The coupled natural-human systems in 
which environmental programs are imple- 
mented are complex, and our understand- 
ing of how programs influence the trajec- 
tory of these systems is incomplete. When 
new program designs in nonenvironmental 
contexts are assessed through formal exper- 
imentation, proponents often learn that the 
innovations fail to have the intended effects 
(3). Scientists and practitioners should not 
expect innovations in environmental pro- 
grams to be any different. 

The absence of experimentation within 
environmental programs can be explained 
in part by historical reasons. Compared 
with other social policy fields such as health, 
poverty, and education, the environmental 
policy field is much younger and would be 
expected to be a late adopter of innovative 
ways of generating evidence. Moreover, the 
human benefits from effective environmen- 
tal programs are less salient than in other 
social policy fields. The foregone benefits 
from ineffective programs are also less sa- 
lient, putting less pressure on program staff 
to show effectiveness. Last, environmental 
practice is dominated by lawyers, engi- 
neers, and natural and physical scientists 
who—unlike health, behavioral, and social 
scientists—do not typically use experimen- 
tal designs in real-world contexts and may 
not anticipate complex human responses to 
what seem like straightforward policy and 
program decisions. Yet there are no struc- 
tural barriers to experimentation in the en- 
vironmental field. 


CONCERNS ABOUT EXPERIMENTATION 
Four primary concerns about embedding 
formal experimentation into environmen- 
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tal programs need to be addressed: 
delayed action, feedback lags, struc- 
tural barriers, and ethical questions. 

First, identifying ways to create 
experimental variation and measure 
outcomes can delay scaled-up imple- 
mentation, letting environmental 
damages accumulate. Yet for the 
case of ineffective programs, the ac- 
cumulated damages could be much 
larger when program managers rely 
on retrospective evaluations that use 
nonexperimental, postimplementa- 
tion data. The costs of delays from 
experimentation will depend on how 
effectively the program meets its ob- 
jectives and how quickly damages 
accumulate. In some cases, large- 
scale action may be required without 
waiting for experimentation (akin to 
“emergency authorizations” in medi- 
cine). Yet we believe that in many 
cases, experimentation embedded 
in program implementation will im- 
prove outcomes in the long run, even 
at the cost of some delay in the short 
run. Similar arguments have been 
made in the recent COVID-19 pan- 
demic, in which calls for quick action 
and for rigorous evidence seemed to 
be in opposition (4). 

Second, the full effects of a pro- 
gram may not materialize for many 
years (for example, long-run cli- 
mate impacts), and the evidence 
may no longer be useful by the time 
it is available. Yet for many envi- 
ronmental problems, the culprit is 
human behavior, for which the de- 
sired changes can be measured on 
shorter timescales (for example, changes in 
energy consumption by households or fertil- 
izer use by farmers). Measures of short-term 
environmental indicators along the hypoth- 
esized causal path may also help elucidate 
whether the intervention is working as in- 
tended (for example, measure pollutants 
that change relatively rapidly rather than 
health conditions that change more slowly). 

Third, structural constraints, such as 
legal and regulatory rules, may present 
barriers to experimentation. The degree 
to which such barriers exist, however, is 
difficult to ascertain given that there has 
been so little historical effort allocated to 
experimentation. 

A fourth concern may seem on the sur- 
face to be the most problematic: Opponents 
of experimentation question the ethics of 
treating some people (or nonhuman organ- 
isms or ecological communities) differently 
than others (5). This concern arises from a 
presumption that those exposed to a pro- 
gram, or a specific version of it, are sure to 
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A culture of experimentation within 


environmental programs 

To reduce pollution, regulators can increase on-site inspections, 
or they can increase opportunities for facilities to do self audits, 
with some penalty leniency when violations are self reported. Self 
audits may be less effective at reducing pollution (measured 
remotely) than on-site inspections because self audits allow facilities 
to hide their noncompliance. Yet self audits may be more effective 
because they make facilities more aware about the law and its 
relationship to their operations and because they transform errors 
of omission into errors of commission. 


Air polluting facilities before program change 
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Randomly split population into two groups 


By randomly varying how the regulator interacts 
with polluting facilities, the regulator can not 
only learn about the relative effectiveness of 
each form of interaction but can also elucidate 
what drives facilities to comply or not with 
environmental regulations (for example, are they 
rational or imperfectly informed?). 


@ Pollution increase @ Pollution decrease 


benefit from it. That assumption, however, 
is not necessarily true. The effects of many 
environmental programs are uncertain. 
One could argue that environmental or- 
ganizations have an ethical obligation to 
better understand the effects of untested 
programs, or changes in programs, before 
large groups of humans and other species, 
particularly vulnerable subgroups, are ex- 
posed to them (akin to the principle of 
“equipoise,” a state of genuine uncertainty 
about the comparative merits of different 
approaches, which is the ethical basis for 
justifying randomized treatments in medi- 
cal trials). Even programs that do not di- 
rectly harm the environment or people may 
simply be ineffective. Directing resources 
to ineffective interventions has substantial 
ethical implications, especially for environ- 
mental problems that are time sensitive, 
such as the loss of biological diversity and 
the accumulation of persistent pollutants. 
If environmental organizations were 
guided by an ethical precept that required ev- 
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idence before changing or scaling up 
a program, the science and practice 
of environmental protection would 
look different and be more success- 
ful. Environmental programs would 
routinely be subjected to experimen- 
tation that deliberately manipulates 
the temporal and spatial variability 
of implementation. Program man- 
agers, perhaps in collaboration with 
academics, would then evaluate the 
results to better understand the con- 
sequences, intended and unintended, 
of the variations in implementation. 
This evidence would provide oppor- 
tunities to adjust and improve cur- 
rent and future programs (6, 7). This 
cycle of program innovation, experi- 
mentation, learning, and adaptation 
is a hallmark of evidence-based pro- 
grams in other fields. 


ENCOURAGING EXPERIMENTATION 
Although the constraints on engag- 
ing in experimentation will vary by 
organization, the opportunities for 
experimentation have some com- 
monalities. On the basis of experi- 
ences in other social policy fields, 
we offer four recommendations for 
expanding the opportunities for 
experimentation in environmental 
programs [for others, see (8, 9)]. 


Political and legal simplicity 
Running an experiment that con- 
trasts an entire program to a no-pro- 
gram control may require extensive 
legal and political approvals, as well 
as expose implementers to reputa- 
tional risks and coordination costs. Instead, 
one version of program implementation 
can be compared with another version by 
manipulating program attributes for which 
managers already have the authority to 
change (often called “A/B testing” in the 
private sector). For example, program man- 
agers could contrast the effects on pollution 
compliance from on-site inspections (status 
quo) versus remote inspections. Leveraging 
already-planned pilot programs can also be 
a practical way to facilitate learning when 
the pilot’s implementation is varied across 
space or time in ways unrelated to the pro- 
gram’s target outcomes. 


Financial simplicity 

Given that the additional costs of experimen- 
tation largely come from the costs of mea- 
suring outcomes, organizations can focus on 
contexts in which the outcomes are collected 
as part of program operations (such as pol- 
lution discharges) or are publicly available 
(such as satellite data of land use). 
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Learning focused 

To achieve higher returns on investment, 
organizations should focus on _ experi- 
mentation that yields results that can be 
generalized across multiple programs. Gen- 
eralizability is more plausible when the pro- 
gram features being manipulated are found 
in many programs (such as capacity build- 
ing and incentives) or motivated by similar 
theories of change. 


Partnership enhanced 

A quick, inexpensive way for environmental 
organizations to acquire the technical ca- 
pacity to design and analyze experiments, 
while keeping the operations in-house, is to 
embed trained experimentalists from out- 
side the organization (for example, through 
federal Volunteer Service Agreements in the 
US context). 

Strengthening the culture of experimenta- 
tion in the environmental community will 
require changes in norms and incentives. 
Program managers are often not rewarded 
for evidence about program effectiveness but 
rather for achieving other objectives (such as 
moving money to constituents, avoiding liti- 
gation by private actors, or pleasing funders). 
Nevertheless, changes in norms and incen- 
tives are occurring. One recent example of 
change is the creation of “behavioral insights 
teams” in governmental and multilateral or- 
ganizations. These teams help program man- 
agers to formally experiment with program 
changes inspired by insights from the behav- 
ioral sciences (JO). 

For federal agencies in the United States, 
changes in norms and incentives are also 
occurring through the Foundations for 
Evidence-Based Policymaking Act of 2018 
(Evidence Act). The Evidence Act and com- 
plementary memoranda from the executive 
branch encourage a culture of experimenta- 
tion both directly and indirectly. They en- 
courage experimentation directly by empha- 
sizing the power and political acceptability of 
randomized implementation designs (17-14). 
They encourage experimentation indirectly 
by requiring agencies to create annual learn- 
ing agendas and a strategy and budget to 
meet their agenda objectives. Learning agen- 
das comprise a set of questions that, when 
answered, are expected to have the biggest 
impact on an agency’s performance. Yet the 
Evidence Act and its associated guidance do 
not provide explicit rewards to staff for pos- 
ing substantive learning questions and using 
experimentation to generate high-quality 
answers to these questions. Thus, by itself, 
the Evidence Act may be insufficient to cre- 
ate a meaningful culture of experimentation 
within environmental agencies. 

One way to further foster a culture of ex- 
perimentation and embed learning in daily 
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operations among US federal agencies would 
be through a new executive order (EO) simi- 
lar in spirit to EO 12291 for Cost-Benefit 
Analyses. This new EO would be triggered 
if a new environmental program, or change 
in a current program, were to exceed a size 
threshold, which could be measured by 
program funding or the size of the affected 
population. The EO would require the imple- 
menting agency to first ascertain the equi- 
poise of the proposed program or change in 


Four conditions when 
experimentation pays off 


Pre-Change Ambiguity 

When theory and experience alone 
cannot unambiguously predict the ex- 
pected impacts of changes in program 
implementation 


Post-Change Ambiguity 

When estimating counterfactual out- 
comes in the absence of a change in 
program implementation is challeng- 
ing using traditional approaches 


High Implementation Cost 

When a change in program implemen- 

tation is unlikely to pass a benefit-cost 

test, or cost-effectiveness assessment, 
without medium or large impacts 


Generalizability of Results 

When the lessons learned from experi- 
mentation are generalizable beyond 
the context in which the program 
change was implemented 


program: Is there strong empirical evidence 
that the proposed action is the best option? 
Tf not, then the agency would be required to 
embed experimentation into the program 
with the intent of quantifying environmental 
and social impacts and understanding the 
mechanisms through which those impacts 
arise. The EO would require that agencies 
insert a step between proposing a program- 
matic change and scaling that programmatic 
change up to the entire eligible population. 
The EO would also encourage environmen- 
tal agency staff to involve statisticians and 
behavioral scientists before implementation. 
Currently, if these experts are called on at all, 
it is after implementation to assess what may 
have transpired—a challenging task when 
implementation was not designed to gener- 
ate evidence about impacts and mechanisms. 
In addition to characterizing what type of 
experimentation is acceptable, the EO would 
also have a stopping rule, similar in spirit to 
stopping rules used to decide when to end 
medical treatment trials. Likewise, the EO 


would also define when it may be acceptable 
to forego experimentation. 

Scientists and practitioners can _legiti- 
mately argue about the benefits and oppor- 
tunity costs of allocating scarce time and fi- 
nancial resources to formal experimentation 
in the environmental sector. Should half of 
environmental programs include experimen- 
tation? Is 10% the right amount? Although 
the optimal share is debatable, we believe 
that the current allocation of roughly 0% is 
suboptimal. How much experimentation is 
embedded in programs should depend on 
contextual attributes that make experimenta- 
tion most valuable (see the box). 

We recognize that experimentation is not 
the only way that a scientific lens can be ap- 
plied to improve our understanding of pro- 
gram implementation. Experimentation is 
best viewed as part of a mixed-methods ap- 
proach to generating evidence rather than 
as a substitute for more traditional ways of 
gathering evidence. Experimentation should, 
however, be a regular feature of programs, 
not a rarity. 
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MEDICINE 


Public health versus 
personalized medicine 


Environmental health research is being undermined 
by genomic medicine, argues a philosopher 


By Henry T. Greely 


he 20th-century journalist H. L. 

Mencken wrote “there is always a 

well-known solution to every hu- 

man problem—neat, plausible, and 

wrong.” In his new book, Tyranny of 

the Gene: Personalized Medicine and 
Its Threat to Public Health, philosopher 
James Tabery points to “personalized medi- 
cine” (by which he seems to mean genomic 
medicine) as an example. “My thesis,” he 
writes, “is that there have been and remain 
powerful financial, political, technological, 
and scientific forces that are driving this 
embrace of personalized medicine and pro- 
moting the idea of medicine as something 
genetic while simultaneously impeding 
the study of environmental determinants 
of wellness and disease.” Tabery supports 
this argument with nine chapters of lively 
history, each interweaving tales of environ- 
mental and genomic medicine. 

The first chapter opens in 1957 with 
Rachel Carson and the rise of environmen- 
tal health concerns. That same year, pio- 
neering medical geneticist Arno Motulsky 
published a paper about how individuals’ 
drug responses vary according to their 
genes. Tabery explores the tensions between 
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these approaches to human health and the 
two projects they engendered: a long-term 
research program named the National 
Children’s Study (NCS), which sought to 
determine the effects of environmental fac- 
tors on child health and develop- 
ment, and a plan for a massive 
genomic database to focus on ge- 
nomic health determinants. 

How the George W. Bush ad- 
ministration treated these ef- 
forts—discouraging the first, 
encouraging the second, but im- 
plementing neither—dominates 
the book’s middle chapters. 
Interspersed are fascinating 
asides on the sequencing giant 
Tlumina’s history (it began as an 
effort to make an artificial nose); about low- 
tech successes in combating sudden infant 
death syndrome; and about the imperialis- 
tic forays of genomics into archaeology and 
educational policy. 

The book also brings to life two cru- 
cial players: Duane Alexander, director of 
the National Institute of Child Health and 
Human Development from 1986 to 2009, 
and Francis Collins, director of the National 
Center for Human Genome Research (re- 
named the National Human Genome 
Research Institute) from 1993 to 2008 and 
of the entire National Institutes of Health 
(NIH) from 2009 to 2021. Alexander was the 
main NIH supporter of the NCS, whereas 


Tyranny of the Gene 
James Tabery 
Knopf, 2023. 336 pp. 


expressed support for a proposal to enroll a million 
Americans in a nationwide genomics research cohort. 


Collins was the force behind the genomic 
database. Tabery’s presentation of Collins— 
who could have been the book’s villain—is 
nuanced, if not altogether admiring. 

The book then moves to the Obama 
administration, which ended the debate 
by canceling the NCS and funding the 
genomic database. That database, now 
named “All of Us,” is nearly halfway to its 
goal of collecting health and genomic data 
on 1 million Americans. 

After a chapter on racial and ethnic dis- 
parities in genomic medicine, the book ends 
with a chapter decrying what Tabery calls 
“the Gleevec scenario.” Gleevec, introduced 
in 2001, is an effective drug against chronic 
myelogenous leukemia. Some heralded it as 
a harbinger of a precision medicine future. 
But Tabery says “it ushered in a new era 
of drugs that don’t work for most patients 
and are so exorbitantly priced that they risk 
bankrupting some of those who are biologi- 
cally eligible.” 

Tabery is clearly right that genomic 
medicine has overpromised and that envi- 
ronmental medicine has been underpur- 
sued. His analysis of the systemic reasons for 
the disparity—from commercial interests, to 
cheap genomic tools, to media and public ex- 
citement about genes—seems likely right. So 
does his disgust at narrowly targeted, over- 
priced treatments protected by 
patent games. 

And yet is personalized ge- 
nomic medicine a threat to public 
health, as the book’s subtitle pro- 
claims? It has helped some peo- 
ple, if only a tiny fraction of those 
in need. Its progress continues, 
albeit slowly, now even in sickle 
cell anemia—a prime example 
of racial disparities in personal- 
ized medicine. And while envi- 
ronmental public health efforts 
deserve more support, they have their own 
problems. As Tabery shows, the feasibility 
of the NCS was unclear when it was killed 
in 2014. Growing distrust of science will not 
have made it easier. Finally, although steep 
drug prices are a huge problem, they afflict 
many, but not all, interventions, both per- 
sonalized and not. 

Genomic personalized medicine has its 
merits, but it has not been the panacea its 
advocates projected. It probably never will 
be. Tabery’s excellent book argues power- 
fully for a more balanced approach to hu- 
man health research. 


10.1126/science.adj0281 
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SCIENCE AND SOCIETY On Disinformation: chee Jon 
How to Fight for Truth and 
Protect Democracy 
attling information bias Lelie 
MIT Press, 2023. 184 pp. 


Do not wait for society to reject scientific disinformation, 


tout the truth now 


By Jonathan Wai 


ost scientists have an enduring 
belief that even if scientific truth 
is not embraced in the short 
term, it will be eventually. Lee 
McIntyre’s new manifesto, On 
Disinformation, packaged—like 
Mao’s—in a little red book, urges 
those who care about scientific truth and 
democracy to make a stand now rather than 
wait for society to come around. 

McIntyre begins by arguing that “seventy 
years of lies about tobacco, evolution, global 
warming, and vaccines” have brought us 
to the present moment of 
heightened disinformation 
because, throughout _his- 
tory, “autocratic leaders and 
their wannabes have under- 
stood that the quickest way 
to control a population is to 
control their information 
sources.” He summarizes 
the history of strategic de- 
nialism, describing, for ex- 
ample, how tobacco compa- 
nies specifically sought “to 
get the public to question 
the truth about something 
that scientists didn’t really 
question” (the health dan- 
gers posed by tobacco), and 
drawing parallels to strate- 
gies used by Donald Trump 
and those engaged in the 
“Make America Great Again” 
(MAGA) movement. 

Strategic denialism can 
only be successful if disinformation is 
created, amplified, and believed, writes 
McIntyre. He explains that modern disin- 
formation warfare is commonly practiced 
in Russian intelligence agencies, a phenom- 
enon described in greater detail in legal phi- 
losopher Scott Shapiro’s 2023 book, Fancy 
Bear Goes Phishing (1). 

In the second part of the book, McIntyre 
introduces “the amplifiers” of disinforma- 
tion—social media companies with busi- 
ness models that incentivize engagement 
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over getting the facts right. The main- 
stream media, he argues, has contributed 
to spreading disinformation as well. When 
an audience comes away from a story less 
informed than when they started owing to 
information bias, this can be even more 
dangerous than when they are exposed to 
a politically biased message, he maintains. 
Undoing the algorithms of disinformation 
should therefore be a priority. 

In the book’s third section, McIntyre 
describes “the believers’—people who do 
not understand how science works and are 
therefore most vulnerable to scientific disin- 
formation. Here he emphasizes that beliefs, 


Researchers must engage audiences they hope to persuade with respect, as scientist 
Katharine Hayhoe does when speaking about climate change. 


even those based on empirical evidence, are 
inextricable from values and identities. 

Face-to-face conversation is the best way 
to counter this phenomenon and change an 
individual’s mind, argues McIntyre. “It always 
happened in the exact same way,’ he writes, 
“through personal engagement with someone 
they already trusted or had grown to trust.” It 
is worth noting here that, although he spends 
a great deal of time interrogating the MAGA 
mindset, McIntyre spends less time exploring 
how an individual’s beliefs may arise not only 
from having been exposed to incorrect infor- 
mation but also as a consequence of political 
resentment that is the result of a fundamen- 
tal difference in worldview. 


Real change, McIntyre argues, likely will 
not come from better education or from ef- 
forts to improve critical thinking. Personal 
engagement seems to be crucial, but chang- 
ing one mind at a time is difficult to scale. 
Larger, structural solutions are likely 
needed, but McIntyre’s treatment does not 
dive deeply into what these solutions might 
be nor how they might be implemented. 
Media scholar Victor Pickard’s 2019 book 
Democracy Without Journalism? offers 
interested readers greater context for the 
larger structural issues at play and potential 
policy solutions (2). 

We must “confront the liars” and “heed 
history,’ McIntyre advises, 
bearing in mind the po- 
tential unintended conse- 
quences of our actions and 
reaching out to those who 
disagree with us. Yet he does 
not fully explore or explain 
how the actions we might 
take today would be differ- 
ent from any we have taken 
in the past. Pseudoscience 
has, after all, been with 
us throughout history (3). 
Treatments from cognitive 
scientists, such as the re- 
cently published Nobody’s 
Fool (4), might provide ad- 
ditional strategies for indi- 
vidual inoculation against 
disinformation. 

McIntyre concludes with 
ideas on “how to win the 
war on truth.” “We need to 
increase the number of mes- 
sengers for truth,’ he argues. “We simply 
need more of them.” Scientists can help by 
building relationships with those they seek 
to persuade and teaching them with respect 
and dignity. @ 
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The global impact of EU 
forest protection policies 


The European Union’s Biodiversity and 
Forest Strategies for 2030 mandate protect- 
ing all remaining old-growth forests across 
the EU, increasing the area of habitat 
patches set aside within forests harvested 
for timber, and limiting clear-felling in 
timber-producing landscapes (1). Although 
saving old-growth forests is critical, stand- 
alone policies can produce unintended 
consequences (2). Without simultaneously 
reducing demand for forest products or 
increasing supply from plantations and 
secondary forests, such measures can lead 
to increased harvesting elsewhere, often 

in tropical countries, to accommodate 
demand. Shifting logging activities to 
countries with weaker legal protections 
aggravates biodiversity and carbon losses 
and exacerbates existing inequities in envi- 
ronmental burdens (3). Isolated policies 
displacing production will also undermine 
the EU’s recent Deforestation Regulation to 
halt imports of deforestation-linked tropi- 
cal products (4). 

EU policies have global effects. In 2022, 
the share of tropical wood and furniture 
imports into EU27 countries reached a 
15-year high of US$4.4 billion (5). The 
risk that EU harvesting restrictions will 
further shift harvesting pressures to the 
tropics is considerable. By 2050, logging 
limits under the EU Biodiversity Strategy 
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could cut European roundwood produc- 
tion by 42%, increasing Brazilian and 
Malaysian non-coniferous roundwood 
extraction by 19% and 8%, respectively 
(6). China’s analogous ban on natural for- 
est harvesting led to a 15% increase in 
solid-wood imports (7), driving extraction 
into carbon-dense, endemic-rich frontiers 
in the Congo Basin (8). Meanwhile, recent 
European trade sanctions on Russia and 
Belarus have eliminated US$4.95 billion 
of timber imports to EU27 countries, 
driving a scramble for additional timber 
centered on the hyperdiverse tropics (5). 
Tropical harvests in old-growth forest 
cause disproportionate damage compared 
with temperate harvests as a result of 
higher diversity and sensitivity of tropical 
biota (9) and weaker governance in tropi- 
cal harvesting regions (J0). 

To avoid worsening its global footprint, 
the EU must urgently integrate better 
mapping and conservation of old-growth 
forests (11) with additional policies. EU 
countries should improve timber product 
longevity and develop resilient, higher- 
yielding plantations on existing degraded 
lands alongside ecological approaches 
that restore native forest while generat- 
ing timber (72). Better quantification of 
the socio-environmental consequences 
of homegrown and imported timber (3) 
and robust harvesting safeguards in all 
timber exporting nations are also needed. 
Crucially, EU countries must carefully 
consider the global consequences of 
domestic forestry changes and logging 
moratoria. Protecting European forests 
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tropical rainforests such as the Amazon. 


is laudable, but trading conservation in 
Europe for far greater impacts in tropical 
rainforests is unacceptable. 
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Solar energy projects put 
food security at risk 


Solar photovoltaic deployment is essential 
to promote renewable energy transition, 
phase down coal-fired power plants, and 
achieve the Paris Agreement tempera- 
ture goals (1). However, large-scale solar 
photovoltaic deployment requires a vast 
amount of land, and a substantial number 
of solar photovoltaic projects have been 
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built on farmland, threatening food secu- 
rity (2, 3). Given the ambitious climate 
pledges of signatory countries to the Paris 
Agreement, the area of land required to 
deploy global solar photovoltaics in the 
coming decades is expected to rise (4). 
Governments must act now to mitigate 
the fierce competition for land between 
solar energy and crops. 

Solar energy projects have encroached 
on farmland across the Northern 
Hemisphere (3). In 2017 alone, China 
deployed photovoltaic panels on about 100 
km? of farmlands in the North China Plain 
(3), one of China’s most important agri- 
cultural regions. Solar photovoltaic panels 
have also been deployed over deserts, 
abandoned mines (5), artificial canals (6), 
reservoirs (7), and rooftops (8), but these 
options are less attractive to developers 
because they are more scarce, more unsta- 
ble, or more expensive than farmlands. 

To ensure national food security, some 
countries have released strict farmland 
protection regulations [e.g., China’s Basic 
Farmland Protection Regulations in 1994, 
Germany’s Federal Regional Planning Act 
in 1997, and South Korea’s Farmland Act in 
1994 (9)]. However, solar energy investors 
and developers continue to occupy farmland 
illegally (10). Local authorities provide inad- 
equate enforcement, allowing development 
to proceed at the expense of agriculture. 

Mitigating solar energy’s land competi- 
tion will require technological innovation 
and more sustainable deployment strate- 
gies. For example, agrivoltaic systems have 
been proposed that would allow crops to 
grow under solar panels (17). However, the 
solar panels hinder mechanized farming 
and harvesting, and the solar photovolta- 
ics need to be deployed at a position much 
higher than crops, making the project 
more expensive. Scientists have also devel- 
oped foldable solar cells that can be inte- 
grated into buildings (12). 

Until these technologies are cost-effective 
and scalable, governments should preferen- 
tially use unproductive lands for large-scale 
photovoltaic deployment, prevent instal- 
lations on finite arable land, and provide 
stricter enforcement of farmland protection 
policies. Satellite remote sensing tech- 
nologies should be used to closely monitor 
solar photovoltaic panels’ illegal farmland 
encroachment and quantify their impacts 
on food production. Illegally deployed solar 
photovoltaics should be demolished so that 
farmland can be restored. Governments, 
corporations, and nonprofit organizations 
should also provide funding to scientists 
to research and develop cost-effective, eco- 
friendly, energy-efficient solar cells, includ- 
ing agrivoltaic technology. Scientists should 
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also work to better understand the adverse 
and unintended consequences of large-scale 
solar photovoltaic deployment to ensure 
that the technology provides net benefits in 
the future. 
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Save China’s gaurs 


The gaur (Bos gaurus), the largest living 
bovine species, primarily inhabits tropical 
and subtropical broadleaf forests, bamboo 
forests, and sparsely tree-covered grass- 
lands (J). In China, the species is mainly 
found in Xishuangbanna Prefecture in 
Yunnan Province (1, 2). Anthropogenic 
changes have brought this population to 
the brink of extinction. China must take 
action to save this vulnerable megafauna. 

Since the 1950s, crop cultivation has 
expanded in Yunnan, resulting in the 
replacement of natural forests (3, 4). In 
some cases, these cultivated lands have even 
encroached into natural reserves (3, 5). As a 
result, the gaur has lost a large area of habi- 
tat, likely forcing the population to relocate 
to steeper natural forest areas (4, 6). 

In addition to habitat loss and fragmenta- 
tion, indiscriminate hunting and illegal trade 
have contributed to the substantial decline 
of the gaur population (7). Between 1979 and 
1985, a staggering total of 83 individuals 
were killed by hunters in Xishuangbanna (1). 
The total gaur population in Xishuangbanna 
has declined from between 605 and 712 
individuals in 1984 to an estimated 152 and 
167 individuals in recent decades (1, 6). In 
Menglun, Xishuangbanna, gaurs are func- 
tionally extinct (7). 

The gaur is included on the National 
Class I key protected wildlife list (2) and 
classified as Critically Endangered on the 
Red List of China’s Vertebrates (8). To pro- 
tect the remaining gaurs, China has desig- 
nated the species as a conservation priority 
in multiple natural reserves (9) and used 


technologies such as infrared cameras to 
monitor them in real time and assess their 
population dynamics and behaviors (J0). 
However, these efforts are insufficient. 

The fragmented habitats within natural 
reserves should be restored immediately 
to natural forests. In areas outside natural 
reserves where the gaur frequently roams 
(11), poaching should be prevented by 
means of increased penalties and enforce- 
ment. The Chinese government should 
offer subsidies and tree-planting training 
programs to incentivize farmers to engage 
in converting farmland to forests, with 
rewards based on their farmland area and 
the number of trees planted. Establishing 
an ecological compensation mechanism 
could enable farmers to participate in ani- 
mal conservation efforts and receive cor- 
responding allowances. Lastly, given that 
the gaur has a wide range of activity and 
migratory habits that allow individuals and 
populations to move based on the weather 
conditions and the availability of food and 
water (4, 11, 12), assisted migration may be 
feasible. When gaurs are trapped in unsuit- 
able locations, unable to migrate due to 
barriers like villages and highways, trans- 
locating some individuals to sparsely popu- 
lated and environmentally suitable areas 
could be successful. Without substantial 
additional conservation strategies, the gaur 
could soon go extinct in China. 
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The global impact of EU 
forest protection policies 


The European Union’s Biodiversity and 
Forest Strategies for 2030 mandate protect- 
ing all remaining old-growth forests across 
the EU, increasing the area of habitat 
patches set aside within forests harvested 
for timber, and limiting clear-felling in 
timber-producing landscapes (1). Although 
saving old-growth forests is critical, stand- 
alone policies can produce unintended 
consequences (2). Without simultaneously 
reducing demand for forest products or 
increasing supply from plantations and 
secondary forests, such measures can lead 
to increased harvesting elsewhere, often 

in tropical countries, to accommodate 
demand. Shifting logging activities to 
countries with weaker legal protections 
aggravates biodiversity and carbon losses 
and exacerbates existing inequities in envi- 
ronmental burdens (3). Isolated policies 
displacing production will also undermine 
the EU’s recent Deforestation Regulation to 
halt imports of deforestation-linked tropi- 
cal products (4). 

EU policies have global effects. In 2022, 
the share of tropical wood and furniture 
imports into EU27 countries reached a 
15-year high of US$4.4 billion (5). The 
risk that EU harvesting restrictions will 
further shift harvesting pressures to the 
tropics is considerable. By 2050, logging 
limits under the EU Biodiversity Strategy 
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could cut European roundwood produc- 
tion by 42%, increasing Brazilian and 
Malaysian non-coniferous roundwood 
extraction by 19% and 8%, respectively 
(6). China’s analogous ban on natural for- 
est harvesting led to a 15% increase in 
solid-wood imports (7), driving extraction 
into carbon-dense, endemic-rich frontiers 
in the Congo Basin (8). Meanwhile, recent 
European trade sanctions on Russia and 
Belarus have eliminated US$4.95 billion 
of timber imports to EU27 countries, 
driving a scramble for additional timber 
centered on the hyperdiverse tropics (5). 
Tropical harvests in old-growth forest 
cause disproportionate damage compared 
with temperate harvests as a result of 
higher diversity and sensitivity of tropical 
biota (9) and weaker governance in tropi- 
cal harvesting regions (J0). 

To avoid worsening its global footprint, 
the EU must urgently integrate better 
mapping and conservation of old-growth 
forests (11) with additional policies. EU 
countries should improve timber product 
longevity and develop resilient, higher- 
yielding plantations on existing degraded 
lands alongside ecological approaches 
that restore native forest while generat- 
ing timber (72). Better quantification of 
the socio-environmental consequences 
of homegrown and imported timber (3) 
and robust harvesting safeguards in all 
timber exporting nations are also needed. 
Crucially, EU countries must carefully 
consider the global consequences of 
domestic forestry changes and logging 
moratoria. Protecting European forests 
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tropical rainforests such as the Amazon. 


is laudable, but trading conservation in 
Europe for far greater impacts in tropical 
rainforests is unacceptable. 
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Solar energy projects put 
food security at risk 


Solar photovoltaic deployment is essential 
to promote renewable energy transition, 
phase down coal-fired power plants, and 
achieve the Paris Agreement tempera- 
ture goals (1). However, large-scale solar 
photovoltaic deployment requires a vast 
amount of land, and a substantial number 
of solar photovoltaic projects have been 
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built on farmland, threatening food secu- 
rity (2, 3). Given the ambitious climate 
pledges of signatory countries to the Paris 
Agreement, the area of land required to 
deploy global solar photovoltaics in the 
coming decades is expected to rise (4). 
Governments must act now to mitigate 
the fierce competition for land between 
solar energy and crops. 

Solar energy projects have encroached 
on farmland across the Northern 
Hemisphere (3). In 2017 alone, China 
deployed photovoltaic panels on about 100 
km? of farmlands in the North China Plain 
(3), one of China’s most important agri- 
cultural regions. Solar photovoltaic panels 
have also been deployed over deserts, 
abandoned mines (5), artificial canals (6), 
reservoirs (7), and rooftops (8), but these 
options are less attractive to developers 
because they are more scarce, more unsta- 
ble, or more expensive than farmlands. 

To ensure national food security, some 
countries have released strict farmland 
protection regulations [e.g., China’s Basic 
Farmland Protection Regulations in 1994, 
Germany’s Federal Regional Planning Act 
in 1997, and South Korea’s Farmland Act in 
1994 (9)]. However, solar energy investors 
and developers continue to occupy farmland 
illegally (10). Local authorities provide inad- 
equate enforcement, allowing development 
to proceed at the expense of agriculture. 

Mitigating solar energy’s land competi- 
tion will require technological innovation 
and more sustainable deployment strate- 
gies. For example, agrivoltaic systems have 
been proposed that would allow crops to 
grow under solar panels (17). However, the 
solar panels hinder mechanized farming 
and harvesting, and the solar photovolta- 
ics need to be deployed at a position much 
higher than crops, making the project 
more expensive. Scientists have also devel- 
oped foldable solar cells that can be inte- 
grated into buildings (12). 

Until these technologies are cost-effective 
and scalable, governments should preferen- 
tially use unproductive lands for large-scale 
photovoltaic deployment, prevent instal- 
lations on finite arable land, and provide 
stricter enforcement of farmland protection 
policies. Satellite remote sensing tech- 
nologies should be used to closely monitor 
solar photovoltaic panels’ illegal farmland 
encroachment and quantify their impacts 
on food production. Illegally deployed solar 
photovoltaics should be demolished so that 
farmland can be restored. Governments, 
corporations, and nonprofit organizations 
should also provide funding to scientists 
to research and develop cost-effective, eco- 
friendly, energy-efficient solar cells, includ- 
ing agrivoltaic technology. Scientists should 


SCIENCE science.org 


also work to better understand the adverse 
and unintended consequences of large-scale 
solar photovoltaic deployment to ensure 
that the technology provides net benefits in 
the future. 
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Save China’s gaurs 


The gaur (Bos gaurus), the largest living 
bovine species, primarily inhabits tropical 
and subtropical broadleaf forests, bamboo 
forests, and sparsely tree-covered grass- 
lands (J). In China, the species is mainly 
found in Xishuangbanna Prefecture in 
Yunnan Province (1, 2). Anthropogenic 
changes have brought this population to 
the brink of extinction. China must take 
action to save this vulnerable megafauna. 
Since the 1950s, crop cultivation has 
expanded in Yunnan, resulting in the 
replacement of natural forests (3, 4). In 
some cases, these cultivated lands have even 
encroached into natural reseryes A afledtbttin 


result, the gaur has lost a largerrearotsaqein G 
tat, likely forcing the population to relocate 


to steeper natural forest areas (4, 6). 

In addition to habitat loss and fragmenta- 
tion, indiscriminate hunting and illegal trade 
have contributed to the substantial decline 
of the gaur population (7). Between 1979 and 
1985, a staggering total of 83 individuals 
were killed by hunters in Xishuangbanna (1). 
The total gaur population in Xishuangbanna 
has declined from between 605 and 712 
individuals in 1984 to an estimated 152 and 
167 individuals in recent decades (1, 6). In 
Menglun, Xishuangbanna, gaurs are func- 
tionally extinct (7). 

The gaur is included on the National 
Class I key protected wildlife list (2) and 
classified as Critically Endangered on the 
Red List of China’s Vertebrates (8). To pro- 
tect the remaining gaurs, China has desig- 
nated the species as a conservation priority 


in multiple natural reserves (9) and used 


technologies such as infrared cameras to 
monitor them in real time and assess their 
population dynamics and behaviors (J0). 
However, these efforts are insufficient. 

The fragmented habitats within natural 
reserves should be restored immediately 
to natural forests. In areas outside natural 
reserves where the gaur frequently roams 
(11), poaching should be prevented by 
means of increased penalties and enforce- 
ment. The Chinese government should 
offer subsidies and tree-planting training 
programs to incentivize farmers to engage 
in converting farmland to forests, with 
rewards based on their farmland area and 
the number of trees planted. Establishing 
an ecological compensation mechanism 
could enable farmers to participate in ani- 
mal conservation efforts and receive cor- 
responding allowances. Lastly, given that 
the gaur has a wide range of activity and 
migratory habits that allow individuals and 
populations to move based on the weather 
conditions and the availability of food and 
water (4, 11, 12), assisted migration may be 
feasible. When gaurs are trapped in unsuit- 
able locations, unable to migrate due to 
barriers like villages and highways, trans- 
locating some individuals to sparsely popu- 
lated and environmentally suitable areas 
could be successful. Without substantial 
additional conservation strategies, the gaur 
could soon go extinct in China. 
Tao Xiang 
Laboratoire Evolution et Diversité Biologique, 
UMR5174, Université Toulouse I||-Paul Sabatier, 
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The global impact of EU 
forest protection policies 


The European Union’s Biodiversity and 
Forest Strategies for 2030 mandate protect- 
ing all remaining old-growth forests across 
the EU, increasing the area of habitat 
patches set aside within forests harvested 
for timber, and limiting clear-felling in 
timber-producing landscapes (1). Although 
saving old-growth forests is critical, stand- 
alone policies can produce unintended 
consequences (2). Without simultaneously 
reducing demand for forest products or 
increasing supply from plantations and 
secondary forests, such measures can lead 
to increased harvesting elsewhere, often 

in tropical countries, to accommodate 
demand. Shifting logging activities to 
countries with weaker legal protections 
aggravates biodiversity and carbon losses 
and exacerbates existing inequities in envi- 
ronmental burdens (3). Isolated policies 
displacing production will also undermine 
the EU’s recent Deforestation Regulation to 
halt imports of deforestation-linked tropi- 
cal products (4). 

EU policies have global effects. In 2022, 
the share of tropical wood and furniture 
imports into EU27 countries reached a 
15-year high of US$4.4 billion (5). The 
risk that EU harvesting restrictions will 
further shift harvesting pressures to the 
tropics is considerable. By 2050, logging 
limits under the EU Biodiversity Strategy 
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could cut European roundwood produc- 
tion by 42%, increasing Brazilian and 
Malaysian non-coniferous roundwood 
extraction by 19% and 8%, respectively 
(6). China’s analogous ban on natural for- 
est harvesting led to a 15% increase in 
solid-wood imports (7), driving extraction 
into carbon-dense, endemic-rich frontiers 
in the Congo Basin (8). Meanwhile, recent 
European trade sanctions on Russia and 
Belarus have eliminated US$4.95 billion 
of timber imports to EU27 countries, 
driving a scramble for additional timber 
centered on the hyperdiverse tropics (5). 
Tropical harvests in old-growth forest 
cause disproportionate damage compared 
with temperate harvests as a result of 
higher diversity and sensitivity of tropical 
biota (9) and weaker governance in tropi- 
cal harvesting regions (J0). 

To avoid worsening its global footprint, 
the EU must urgently integrate better 
mapping and conservation of old-growth 
forests (11) with additional policies. EU 
countries should improve timber product 
longevity and develop resilient, higher- 
yielding plantations on existing degraded 
lands alongside ecological approaches 
that restore native forest while generat- 
ing timber (72). Better quantification of 
the socio-environmental consequences 
of homegrown and imported timber (3) 
and robust harvesting safeguards in all 
timber exporting nations are also needed. 
Crucially, EU countries must carefully 
consider the global consequences of 
domestic forestry changes and logging 
moratoria. Protecting European forests 


EU policies have unintended consequences for | Cece 


tropical rainforests such as the Amazon. 


is laudable, but trading conservation in 
Europe for far greater impacts in tropical 
rainforests is unacceptable. 
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Solar energy projects put 
food security at risk 


Solar photovoltaic deployment is essential 
to promote renewable energy transition, 
phase down coal-fired power plants, and 
achieve the Paris Agreement tempera- 
ture goals (1). However, large-scale solar 
photovoltaic deployment requires a vast 
amount of land, and a substantial number 
of solar photovoltaic projects have been 
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built on farmland, threatening food secu- 
rity (2, 3). Given the ambitious climate 
pledges of signatory countries to the Paris 
Agreement, the area of land required to 
deploy global solar photovoltaics in the 
coming decades is expected to rise (4). 
Governments must act now to mitigate 
the fierce competition for land between 
solar energy and crops. 

Solar energy projects have encroached 
on farmland across the Northern 
Hemisphere (3). In 2017 alone, China 
deployed photovoltaic panels on about 100 
km? of farmlands in the North China Plain 
(3), one of China’s most important agri- 
cultural regions. Solar photovoltaic panels 
have also been deployed over deserts, 
abandoned mines (5), artificial canals (6), 
reservoirs (7), and rooftops (8), but these 
options are less attractive to developers 
because they are more scarce, more unsta- 
ble, or more expensive than farmlands. 

To ensure national food security, some 
countries have released strict farmland 
protection regulations [e.g., China’s Basic 
Farmland Protection Regulations in 1994, 
Germany’s Federal Regional Planning Act 
in 1997, and South Korea’s Farmland Act in 
1994 (9)]. However, solar energy investors 
and developers continue to occupy farmland 
illegally (10). Local authorities provide inad- 
equate enforcement, allowing development 
to proceed at the expense of agriculture. 

Mitigating solar energy’s land competi- 
tion will require technological innovation 
and more sustainable deployment strate- 
gies. For example, agrivoltaic systems have 
been proposed that would allow crops to 
grow under solar panels (17). However, the 
solar panels hinder mechanized farming 
and harvesting, and the solar photovolta- 
ics need to be deployed at a position much 
higher than crops, making the project 
more expensive. Scientists have also devel- 
oped foldable solar cells that can be inte- 
grated into buildings (12). 

Until these technologies are cost-effective 
and scalable, governments should preferen- 
tially use unproductive lands for large-scale 
photovoltaic deployment, prevent instal- 
lations on finite arable land, and provide 
stricter enforcement of farmland protection 
policies. Satellite remote sensing tech- 
nologies should be used to closely monitor 
solar photovoltaic panels’ illegal farmland 
encroachment and quantify their impacts 
on food production. Illegally deployed solar 
photovoltaics should be demolished so that 
farmland can be restored. Governments, 
corporations, and nonprofit organizations 
should also provide funding to scientists 
to research and develop cost-effective, eco- 
friendly, energy-efficient solar cells, includ- 
ing agrivoltaic technology. Scientists should 
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also work to better understand the adverse 
and unintended consequences of large-scale 
solar photovoltaic deployment to ensure 
that the technology provides net benefits in 
the future. 
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Save China’s gaurs 


The gaur (Bos gaurus), the largest living 
bovine species, primarily inhabits tropical 
and subtropical broadleaf forests, bamboo 
forests, and sparsely tree-covered grass- 
lands (J). In China, the species is mainly 
found in Xishuangbanna Prefecture in 
Yunnan Province (1, 2). Anthropogenic 
changes have brought this population to 
the brink of extinction. China must take 
action to save this vulnerable megafauna. 

Since the 1950s, crop cultivation has 
expanded in Yunnan, resulting in the 
replacement of natural forests (3, 4). In 
some cases, these cultivated lands have even 
encroached into natural reserves (3, 5). As a 
result, the gaur has lost a large area of habi- 
tat, likely forcing the population to relocate 
to steeper natural forest areas (4, 6). 

In addition to habitat loss and fragmenta- 
tion, indiscriminate hunting and illegal trade 
have contributed to the substantial decline 
of the gaur population (7). Between 1979 and 
1985, a staggering total of 83 individuals 
were killed by hunters in Xishuangbanna (1). 
The total gaur population in Xishuangbanna 
has declined from between 605 and 712 
individuals in 1984 to an estimated 152 and 
167 individuals in recent decades (1, 6). In 
Menglun, Xishuangbanna, gaurs are func- 
tionally extinct (7). 

The gaur is included on the National 
Class I key protected wildlife list (2) and 
classified as Critically Endangered on the 
Red List of China’s Vertebrates (8). To pro- 
tect the remaining gaurs, China has desig- 
nated the species as a conservation priority 
in multiple natural reserves (9) and used 


technologies such as infrared cameras to 
monitor them in real time and assess their 
population dynamics and behaviors (J0). 
However, these efforts are insufficient. 

The fragmented habitats within natural 
reserves should be restored immediately 
to natural forests. In areas outside natural 
reserves where the gaur frequently roams 
(11), poaching should be prevented by 
means of increased penalties and enforce- 
ment. The Chinese government should 
offer subsidies and tree-planting training 
programs to incentivize farmers to engage 
in converting farmland to forests, with 
rewards based on their farmland area and 
the number of trees planted. Establishing 
an ecological compensation mechanism 
could enable farmers to participate in ani- 
mal conservation efforts and receive cor- 
responding allowances. Lastly, given that 
the gaur has a wide range of activity and 
migratory habits that allow individuals and 
populations to move based on the weather 
conditions and the availability of food and 
water (4, 11, 12), assisted migration may be 
feasible. When gaurs are trapped in unsuit- 
able locations, unable to migrate due to 
barriers like villages and highways, trans- 
locating some individuals to sparsely popu- 
lated and environmentally suitable areas 
could be successful. Without substantial 
additional conservation strategies, the gaur 
could soon go extinct in China. 
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~“ STELLAR ASTROPHYSICS 


Helium star that will 
become a magnetar 


agnetars are neutron stars 
with extremely strong 
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magnetic fields, the origin of 

whichis debated. One pos- 

sibility is amplification of a 
magnetic field in the core of the parent 
star, which produces the neutron star 
during a supernova explosion. However, 
such fields have not been observed in 
stars that are about to explode. Shenar et 
al. used spectropolarimetry to identify a high — 
magnetic field in a Wolf-Rayet, the exposed 
helium core of a star that has lost its outer layers 
of hydrogen. The mass of the Wolf-Rayet is high 
enough that it will produce a neutron star in a super- 
nova, and the field is sufficiently strong to generate / 
magnetar during core collapse. —KTS 


Science, ade3293, this issue p. 761 


PHYSICS 
Peculiar rotations in 


fullerenes 

Ergodicity breaking, the inability 
of asystem to thermalize, is of 
fundamental interest in statisti- 
cal mechanics and physics and 
has been vigorously studied in 
many systems. Using high-sensi- 
tivity infrared spectroscopy, Liu 
et al. collected compelling exper- 
imental signatures of previously 
unobserved rotational ergodicity 
breaking in a buckminsterfuller- 
ene molecule (C,,) that arose 
from rotation-vibration coupling 
and were distinctly different from 
what was found in all prior stud- 
ies. Because of its symmetry, 
size, and rigidity, C,, can switch 
back and forth between ergodic 
and non-ergodic rotational 
regimes as it rotates faster and 
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faster. The present work reports 
peculiar rotational dynamics in 
C,, and demonstrates how such 
a familiar but relatively unex- 
plored molecule can be used to 
observe new phenomena. —YS 
Science, adi6354, this issue p. 778 


NANOMATERIALS 
Metal clusters as active 
waveguides 


Optical waveguides are often 
passive devices that rely on 
changes in the refractive index 
of materials to confine and guide 
light, but they can also function 
through optical active materi- 
als that absorb and remit light. 
Wang et al. show that crystals 
of small metal clusters such 

as PtAg,, protected by ligand 
shells can exhibit high optical 


waveguide performance with low 
optical loss coefficients and high 
polarization of the re-emitted 
light. These effects are ampli- 
fied by favorable crystal packing 
and molecular orientation 
effects.—PDS 

Science, adh2365, this issue p. 784 


PHYSICS 
Disorder to the rescue 


Many correlated electron 
systems, such as cuprates 

and heavy fermion materials, 
host an unusual type of metal- 
lic state called strange metal. 
Strange metals have transport 
and thermodynamic properties 
with temperature dependences 
that differ from those of ordinary 
metals. Devising a theory that 
describes all of these properties 
correctly remains challenging. 
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massive hel m ilar witha 
st rong magnetic field 


Patel et al. achieved this goal 

by introducing disorder in the 

coupling constants of a model of 

strongly interacting systems. —JS 
Science, abq6011, this issue p. 790 


BIOPHYSICS 
The search for Piezo 
channel partners 


Piezo ion channels act as sen- 
sors that convert mechanical 
energy into cellular signals, but 
the mechanistic details of their 
modulation remains to be fully 
elucidated. Zhou et al. used two 
molecular strategies in combina- 
tion with CRISPR/Cas9 editing 
to identify previously unrecog- 
nized Piezo channel—binding 
partners. Using a blend of elec- 
trophysiology and cryo-electron 
microscopy, the authors provide 
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molecular details of how MyoD 
family inhibitor proteins interact 
with Piezo channels and regulate 
their gating. They identified an 
interacting interface that likely 
represents a scaffold that might 
be leveraged to generate Piezo- 
targeted therapeutics. -MMa 
Science, adh8190, this issue p. 799 


Protective parasites 
Parasitic helminths generate a 
type 2 immune response that 
can persist even after clearance 
of an infection. Using a mouse 
model of roundworm infection, 
Oyesola et al. found that prior 
exposure to lung-migrating 
helminths protects transgenic 
K18-hACE2 mice against SARS- 
CoV-2 infection. Pulmonary 
macrophages from roundworm- 
infected mice adopted a type 2 
transcriptional and epigenetic 
signature that persisted after 
parasite clearance and at 

least 45 days after infection. 
SARS-CoV-2-specific CD8* T 
cell responses were driven by 
alveolar macrophages and were 
required for helminth-mediated 
protection. These results dem- 
onstrate that lung-migrating 
helminths reprogram lung 
immune homeostasis, leading 
to enhanced protection against 
subsequent SARS-CoV-2 infec- 
tion. —CO 


Sci. Immunol. (2023) 
10.1126/sciimmunol.adf8161 


Targeting senescence 
with CAR T cells 


Cellular senescence con- 
tributes to aging and to 
aging-associated diseases, 
and depletion of senescent 
cells has shown potential for 
treating such diseases. Yang 
et al. developed chimeric 
antigen receptor (CAR) T cells 
targeting Natural Killer Group 
2 Member D (NKG2D) ligands, 
which are highly expressed 

on senescent cells. The CAR, 
which is composed of the 
extracellular domain of NKG2D 
and the intracellular signaling 
domains of two receptors, was 


744: 


able to efficiently drive killing of 
senescent cells in vitro and in 
vivo in both aged mice and aged 
nonhuman primates. Aged mice 
treated with NKG2D-CART cells 
exhibited improved physical 
performance after treatment. 
These data support the further 
development of NKG2D-CAR 
T cells for aging-associated 
pathologies. —CM 
Sci. Transl. Med. (2023) 
10.1126/scitranslmed.add1951 


Two-state two-step 
Natural proteins often adopt 
multiple conformational states, 
thereby changing their activity or 
binding partners in response to 
another protein, small molecule, 
or other stimulus. It has been 
difficult to engineer such con- 
formational switching between 
two folded states in human- 
designed proteins. Praetorius et 
al. developed a hinge-like protein 
by simultaneously consider- 
ing both desired states in the 
design process. The successful 
designs exhibited a large shift in 
conformation upon binding to a 
target peptide helix, which could 
be tailored for specificity. The 
authors characterized the pro- 
tein structures, binding kinetics, 
and conformational equilib- 
rium of the designs. This work 
provides the groundwork for 
generating protein switches that 
respond to biological triggers 
and can produce conformational 
changes that modulate protein 
assemblies. —MAF 

Science, adg7731, this issue p. 754 


Illustration of a designed protein 
that can switch between two different 
conformational states 
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ENERGY 


Another route 
to refrigeration 


pace cooling accounts for 20% of 
the world’s total energy consump- 
tion but relies on refrigerants with 


large global-warming potential. 

Zhou et al. now demonstrate an 
alternative to refrigerant-based cool- 
ing that uses a compression-based 


regenerative elastocaloric device 


that performs phase transformations 
during which latent heat is absorbed. 
This tubular nickel—-titanium device 
achieves a temperature difference 
between the hot and cold sides of 50 
kelvin and a cooling power of more 


than 200 watts. —ECF 


Joule (2023) 10.1016/j.joule.2023.07.004 


Elastocaloric cooling offers an attractive 
alternative to the refrigerant-based technology 


now used in most refrigerators. 


Spartin mediates 
lipophagy 


Lipid droplets are important 
for energy storage and lipid 
homeostasis. Selective 
autophagy of lipid droplets, 
called lipophagy, is required 
for their catabolism, but how 
lipid droplets are targeted for 
lipophagy remains unclear. 
Working in cultured human 
cells, Chung et al. elucidated a 
role for spartin, a protein that 
localizes on lipid droplets and 
endosomal compartments, 

in lipophagy. Spartin functions 
as areceptor on lipid droplets 
that interacts with the core 
autophagy machinery. Thus, 
spartin is required to deliver 
lipid droplets to lysosomes for 
triglyceride mobilization. 
Mutations in the gene encoding 


spartin lead to Troyer syndrome, 


a form of hereditary spastic 
paraplegia. Indeed, blocking 


spartin function caused lipid 
droplet and triglyceride 
accumulation in mouse and 
cultured human neurons. —SMH 
Nat. Cell Biol. (2023) 
https://doi.org/10.1038/ 
$41556-023-01178-w 


AMASTerful compilation 


ast cells (MCs) are aclass 

of granulocytes that release 
inflammatory mediators in 
response to an array of stimuli. 
MCs are currently organized into 
mucosal MCs (MMCs), found in 
the gut and lung, and connec- 
tive tissue—type MCs (CTMCs), 
ocated primarily in the skin 
and peritoneal cavity. To gain 

a picture of the heterogeneity 
and ontogeny of MC subsets, 
Tauber et al. integrated several 
single-cell datasets of mouse and 
human MC populations across 

a variety of tissues. The authors 
found in mice that Mrgprb2* and 
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PALEONTOLOGY 
Afiery demise 


It is well known that many large 
vertebrate species went extinct 
during the late Pleistocene in 
most regions of the world. What 
caused these extinctions remains 
debated, although both climate 
change and human impacts have 
been implicated. O'Keefe et al. 
used the extensive fossil record 
created by the entrapment of 
animals in the La Brea tar pits in 
conjunction with nearby core sam- 
ples and found a clear relationship 
between an increase in fire—and 
fire-related ecosystems—and 
large mammal extinction. The 
authors argue that this increase 
in fire may have resulted from 
climate change-induced warming 
and drying in conjunction with 
increasing impacts of humans in 
the system. —SNV 

Science, abo3594, this issue p. 746 


HUMAN DEVELOPMENT 
Ayolk masterstroke 


The yolk sac (YS) is a membra- 
nous structure attached to the 
developing embryo that acts to 
support hematopoiesis, metabo- 
lism, and coagulation. Our current 
understanding of its role in human 
development has been limited by 
areliance on nonhuman model 
systems. Goh et al. used single- 
cell RNA sequencing, light-sheet 
microscopy, and RNAscope-based 
in situ hybridization to generate a 
human YS atlas derived from 10 
samples spanning 4 to 8 postcon- 
ception weeks or Carnegie stages 
10 to 23. The authors found that 
endoderm in human YS, but not in 
murine YS, provides hematopoi- 
etic growth factors. Furthermore, 
in humans, the YS is the dominant 
source of early erythropoiesis, 
unlike in mice, in which the liver 
also plays a role. Finally, the 
authors report that human YS can 
fuel the accelerated production of 
macrophages from hematopoietic 
stem and progenitor cells inde- 
pendently of monocytes. —STS 
Science, add7564, this issue p. 747 
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CANCER 
Using a chaperone to 


fight cancer 
KRAS is one of the most 
common oncogenes, but unfor- 
tunately it is also commonly 
thought of as “undruggable” 
because it lacks a suitable bind- 
ing pocket for small-molecule 
drug candidates. To get around 
this limitation, Schulze et al. 
built on observations from 
natural product—derived drugs 
to go after oncogenic KRAS 
indirectly (see the Perspective 
by Liu). The authors identified a 
naturally occurring compound 
that binds cyclophilin A, a type 
of cellular chaperone, and then 
modified this compound to also 
bind oncogenic mutant KRAS 
in a three-way complex. The 
authors used this approach to 
design multiple small mol- 
ecules that effectively bound 
mutant KRAS in complex with 
cyclophilin A. These molecules 
were very effective at inhibit- 
ing the downstream pathways 
involved in cell proliferation and 
at suppressing tumor growth in 
multiple models. —YN 
Science, adg9652, 
this issue p. 794; 
see also adj1001, p. 729 


OPTICS 
Making superlenses super 


In most imaging techniques, the 
smallest feature sizes that can 
be resolved are on the order of 
the wavelength of light used for 
illumination. Several techniques 
have been developed that can 
resolve subwavelength features. 
Of these, superlenses fabricated 
from plasmonic materials and 
metamaterials suffer optical 
losses that limit their perfor- 
mance. Guan et al. show that 
illumination with synthetic fre- 
quency waves, waveforms that 
that are temporally attenuated, 
can retrieve subwavelength 
features, thus making their 
superlenses truly super. This 
work demonstrates a practi- 

cal approach to overcoming 
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intrinsic losses in plasmonic 

systems and shows that there 

is potential for substantial 

improvements in imaging and 

sensing capabilities. —ISO 
Science, adi1267 this issue p. 766 


MOLECULAR BIOLOGY 
How POT] protects 
human telomeres 


Mammalian chromosomes are 
capped with telomeric DNA that 
is mostly double-stranded but 
terminates in a single-stranded 
overhang. A multiprotein 
complex called shelterin coats 
telomeric DNA to protect 
chromosome ends from being 
recognized as DNA breaks. 
Tesmer et al. have revealed that 
the human shelterin protein 
POT1 protects the telomeric 
double-stranded-single- 
stranded DNA junction. Only 
one of the two mouse POT] pro- 
teins binds this junction, which 
explains its sufficiency for end 
protection. Disrupting junction 
binding of human POT] alters 
the 5’ terminus sequence and 
marks chromosome ends as 
sites of DNA damage, demon- 
strating the importance of this 
interaction for chromosome end 
protection. —DJ 

Science, adi2436, this issue p. 771 


BIOPHYSICS 
Bacterial degradation of 
hydrocarbons 


Certain marine bacteria can 
degrade small-molecule hydro- 
carbons, but there is still limited 
understanding on how this pro- 
cess works in biofilms. Prasad et 
al. show that one such bacte- 
rium, Alcanivorax borkumensis, 
initially forms a spherical biofilm 
around a droplet of hexadecane, 
which subsequently grows and 
buckles (see the Perspective 

by McGenity and Laissue). This 
transition is caused by an initial 
limited interaction with the oil by 
only some of the cells, followed 
by rapid cell growth and division 


that distorts the shape of the 
biofilm, leading to an increase in 
the surface area and accelera- 
tion in the rate of consumption. 
Adhesion of aligned rod-shaped 
bacterial cells to the oil stabilizes 
the tube-like protrusions. The 
addition of surfactants com- 
monly used to disperse marine 
oil spills disrupts the biofilms, 
and the cells return to their 
spherical morphology. —MSL 
Science, adf3345, this issue p. 748; 
see also adj4430, p. 728 


CORONAVIRUS 
Coopting virus- 
neutralizing antibodies 


The emergence of severe 
acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) 
stimulated interest in repur- 
posing neutralizing antibodies 
against related viruses such as 
SARS-CoV-1, which caused a 
smaller outbreak in 2002-2004. 
Zhao et al. sought to engineer 
mutations in a SARS-CoV-1-neu- 
tralizing monoclonal antibody 
isolated from a convalescent 
donor to enable this antibody 
to neutralize SARS-CoV-2.A 
candidate engineered antibody 
blocked SARS-CoV-2 infection 
of cells in vitro and prophylacti- 
cally protected hamsters from 
viral challenge. These results 
highlight the potential of this 
approach for refocusing an 
existing antibody to neutralize a 
related virus. —JFF 
Sci. Signal. (2023) 
10.1126/scisignal.abk3516 


CELL BIOLOGY 
Taking cell atlases 
a step further 


Cell atlases categorize single 
cells into different types on the 
basis of molecular features, 
including the transcriptome and 
epigenome. However, diversity 
within the categories in these 
static atlases have raised ques- 
tions about how best to define 
cell types. In a Perspective, Fleck 
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et al. suggest that cell type is 
not only defined by the develop- 
mental history of the cell and its 
current set of features, it is also 
inked to how the cell responds 
to perturbation. The authors 
propose that atlasing these 
perturbation responses, which 
together are referred to as the 
phenoscape, in primary tissue 
and in three-dimensional human 
cell culture models could provide 
amore complete understanding 
of cell types. —SAL 

Science, adf6162, this issue p. 733 
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molecular details of how MyoD 
family inhibitor proteins interact 
with Piezo channels and regulate 
their gating. They identified an 
interacting interface that likely 
represents a scaffold that might 
be leveraged to generate Piezo- 
targeted therapeutics. -MMa 
Science, adh8190, this issue p. 799 


Protective parasites 
Parasitic helminths generate a 
type 2 immune response that 
can persist even after clearance 
of an infection. Using a mouse 
model of roundworm infection, 
Oyesola et al. found that prior 
exposure to lung-migrating 
helminths protects transgenic 
K18-hACE2 mice against SARS- 
CoV-2 infection. Pulmonary 
macrophages from roundworm- 
infected mice adopted a type 2 
transcriptional and epigenetic 
signature that persisted after 
parasite clearance and at 

least 45 days after infection. 
SARS-CoV-2-specific CD8* T 
cell responses were driven by 
alveolar macrophages and were 
required for helminth-mediated 
protection. These results dem- 
onstrate that lung-migrating 
helminths reprogram lung 
immune homeostasis, leading 
to enhanced protection against 
subsequent SARS-CoV-2 infec- 
tion. —CO 


Sci. Immunol. (2023) 
10.1126/sciimmunol.adf8161 


Targeting senescence 
with CAR T cells 


Cellular senescence con- 
tributes to aging and to 
aging-associated diseases, 
and depletion of senescent 
cells has shown potential for 
treating such diseases. Yang 
et al. developed chimeric 
antigen receptor (CAR) T cells 
targeting Natural Killer Group 
2 Member D (NKG2D) ligands, 
which are highly expressed 

on senescent cells. The CAR, 
which is composed of the 
extracellular domain of NKG2D 
and the intracellular signaling 
domains of two receptors, was 


able to efficiently drive killing of 
senescent cells in vitro and in 
vivo in both aged mice and aged 
nonhuman primates. Aged mice 
treated with NKG2D-CART cells 
exhibited improved physical 
performance after treatment. 
These data support the further 
development of NKG2D-CAR 
T cells for aging-associated 
pathologies. —CM 
Sci. Transl. Med. (2023) 
10.1126/scitranslmed.add1951 


Two-state two-step 
Natural proteins often adopt 
multiple conformational states, 
thereby changing their activity or 
binding partners in response to 
another protein, small molecule, 
or other stimulus. It has been 
difficult to engineer such con- 
formational switching between 
two folded states in human- 
designed proteins. Praetorius et 
al. developed a hinge-like protein 
by simultaneously consider- 
ing both desired states in the 
design process. The successful 
designs exhibited a large shift in 
conformation upon binding to a 
target peptide helix, which could 
be tailored for specificity. The 
authors characterized the pro- 
tein structures, binding kinetics, 
and conformational equilib- 
rium of the designs. This work 
provides the groundwork for 
generating protein switches that 
respond to biological triggers 
and can produce conformational 
changes that modulate protein 
assemblies. —MAF 

Science, adg7731, this issue p. 754 


Illustration of a designed protein 
that can switch between two different 
conformational states 
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ENERGY 


Another route 
to refrigeration 


pace cooling accounts for 20% of 
the world’s total energy consump- 
tion but relies on refrigerants with 


large global-warming potential. 

Zhou et al. now demonstrate an 
alternative to refrigerant-based cool- 
ing that uses a compression-based 
regenerative elastocaloric device 


that performs phase transformations 


during which latent heat is absorbed. 
This tubular nickel—-titanium device 
achieves a temperature difference 
between the hot and cold sides of 50 
kelvin and a cooling power of more 


Check for 
updates 


than 200 watts. —ECF 


Joule (2023) 10.1016/j.joule.2023.07.004 


Elastocaloric cooling offers an attractive 
alternative to the refrigerant-based technology 


now used in most refrigerators. 


Spartin mediates 
lipophagy 


Lipid droplets are important 
for energy storage and lipid 
homeostasis. Selective 
autophagy of lipid droplets, 
called lipophagy, is required 
for their catabolism, but how 
lipid droplets are targeted for 
lipophagy remains unclear. 
Working in cultured human 
cells, Chung et al. elucidated a 
role for spartin, a protein that 
localizes on lipid droplets and 
endosomal compartments, 

in lipophagy. Spartin functions 
as areceptor on lipid droplets 
that interacts with the core 
autophagy machinery. Thus, 
spartin is required to deliver 
lipid droplets to lysosomes for 
triglyceride mobilization. 
Mutations in the gene encoding 


spartin lead to Troyer syndrome, 


a form of hereditary spastic 
paraplegia. Indeed, blocking 


spartin function caused lipid 
droplet and triglyceride 
accumulation in mouse and 
cultured human neurons. —SMH 
Nat. Cell Biol. (2023) 
https://doi.org/10.1038/ 
$41556-023-01178-w 
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ast cells (MCs) are aclass 

of granulocytes that release 
inflammatory mediators in 
response to an array of stimuli. 
MCs are currently organized into 
mucosal MCs (MMCs), found in 
the gut and lung, and connec- 
tive tissue—type MCs (CTMCs), 
ocated primarily in the skin 
and peritoneal cavity. To gain 

a picture of the heterogeneity 
and ontogeny of MC subsets, 
Tauber et al. integrated several 
single-cell datasets of mouse and 
human MC populations across 

a variety of tissues. The authors 
found in mice that Mrgprb2* and 
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Mrgprb2- MCs are transcrip- 
tionally distinct populations 
that generally recapitulate the 
CTMC-MMC dichotomy. Seven 
other distinct MC transcriptomic 
profiles in humans were also 
apparent, although whether 
these are transcriptomic states 
or genuine MC subsets remains 
an open question. —STS 
J. Exp. Med. (2023) 
10.1084/jem.20230570 


CONSERVATION 
Sequencing a sprinkle 
of shark dust 


Genomics is growing increas- 
ingly important for identifying 
endangered species that have 
been traded illicitly, but many 
sequencing methods are 
expensive or species specific. 
Prasetyo et al. performed 
meta-barcoded high-throughput 
sequencing of trace DNA from 
seven locations in Indonesia 


SCIENCE science.org 


engaged in illegal trading of 
sharks and rays. They were able 
to assign most of these reads 
to individual species, but some 
could only be mapped to family 
or genus. Although there was 
good agreement between the 
species identified using dust and 
concomitantly collected tissue 
samples, the authors identified 
more species using dust. Both 
methods, however, identified 
multiple endangered species 
being processed in these loca- 
tions, demonstrating the need 
for better and cheaper genetic 
tools with which to monitor the 
illegal wildlife trade. —CNS 
Conserv. Lett. (2023) 
10.1111/conl.12971 


PHOTOCHEMISTRY 
Stabilizing excited 
molybdenum complexes 


Complex ligands are usually 
needed to stabilize long-lived 


photoexcited states of earth- 
abundant metals. Kitzmann 
et al. found that a simple 
tripodal polypyridyl ligand, 
1,1,1-tris(pyrid-2-yl)ethane, 
stabilizes a molybdenum tri- 
carbonyl complex that exhibits 
intense deep-red phosphores- 
cence with a lifetime of several 
hundred nanoseconds. The 
lack of trans carbon monoxide 
ligands appears to limit elonga- 
tion of the molybdenum-carbon 
bonds and loss of carbon mon- 
oxide in the excited state. This 
complex was used to upconvert 
green photons to blue photons 
and to drive a dehalogenation 
reaction with green light. —PDS 
J.Am. Chem. Soc. (2023) 
10.1021/jacs.3c03832 


BEHAVIOR 
Strong storage 


Squirrels are known for their 
caching behavior. Throughout 
the northern hemisphere, 
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squirrels are seen digging and 
burying nuts all summer in 
preparation for winter. As a 
widely distributed lineage, how- 
ever, most squirrels live in warm 
tropical regions, where burying 
may result in nut deterioration. 
Noticing nuts suspended in the 
crooks of trees and shrubs in 
Hainan Province, China, Xu et 
al. discovered that at least two 
species of flying squirrel have 
come up with a storage solution 
suitable for wet tropical climates. 
Video footage revealed that 
Hylopetes phayrei electilis and 
Hylopetes alboniger used their 
incisors to chisel grooves into 
nuts that had been primarily 
harvested from Cyclobalanopsis 
trees. The carved nuts allowed 
secure storage by acting as 
a tight mortise-tenon joint at 
branch nodes. —SNV 

eLife (2023) 10.7554/eLife.84967 


WATER 
Water dissociation 
at the nanoscale 


Despite considerable inter- 
est in understanding water 
behavior at the nanoscale 
regime, the role of confine- 
ment on the chemical reactivity 
properties of water, such as its 
self-dissociation, remains poorly 
understood, and various confus- 
ing hypotheses have been made 
based on indirect experimen- 
tal evidence and simplified 
theoretical conceptions. Using 
rigorous first-principles molecu- 
lar dynamics simulations, Di 
Pino et al. demonstrated that 
the initial break of the oxy- 
gen—hydrogen covalent bond 
determines the energetics of 
bulk water dissociation, which 
unexpectedly does not translate 
to small system dimensions of 
only a dozen molecules or pores 
of widths below 2 nanome- 
ters, in the absence of specific 
interfacial interactions. This 
work provides a comprehensive 
picture of water dissociation 
at the nanoscale, which should 
help to eliminate some of the 
contradictions in previous find- 
ings. -YS 

Angew. Chem. Int. Ed. (2023) 

10.1002/anie.202306526 
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Pre-Younger Dryas megafaunal extirpation at 
Rancho La Brea linked to fire-driven state shift 


F. Robin O’Keefe*, Regan E. Dunn, Elic M. Weitzel, Michael R. Waters, Lisa N. Martinez, 
Wendy J. Binder, John R. Southon, Joshua E. Cohen, Julie A. Meachen, Larisa R. G. DeSantis, 
Matthew E. Kirby, Elena Ghezzo, Joan B. Coltrain, Benjamin T. Fuller, Aisling B. Farrell, 

Gary T. Takeuchi, Glen MacDonald, Edward B. Davis, Emily L. Lindsey 


INTRODUCTION: At the end of the Pleistocene, 
most of Earth’s large mammals (megafauna) 
became extinct. These extinctions occurred at 
different times globally, resulting in a drastic 
reorganization of terrestrial ecosystems. Despite 
decades of research on extinction causality, the 
relative importance of late-Quaternary climate 
changes and spreading human impacts have 
been difficult to disentangle because poor chro- 
nological resolution in the fossil record has 
precluded alignment of these rapidly occurring, 
tightly linked phenomena. 


RATIONALE: The Rancho La Brea (RLB) locality 
in Southern California provides a unique oppor- 
tunity to investigate decadal-scale changes in 
megafaunal populations and community com- 
position across the latest Pleistocene. At this site, 
naturally occurring asphalt seeps entrapped and 
preserved the bones of hundreds, and in some 
cases thousands, of individuals from numerous 
megafaunal species across the last 50,000 years 
of the Pleistocene. Nearly all of these osteological 
specimens preserve original collagen, which 
permits precise radiocarbon dating analysis. 


Sequence of ecological 
events as recorded at 
Rancho La Brea, California. 
Top left: conditions around 
the tar pits were moist 

and cool, with abundant trees 
and megafaunal mammals. 
Bottom left: the onset of 
postglacial warming and 
drying begins as human 
pressure on herbivores 
increases. Top right: the 
synergy between climatic and 
human impacts enables a 
sudden ecological state 
transition characterized 

by unprecedented fire activity. 
Bottom right: a chapparal 
ecosystem is established; 
megafauna are extinct, and 
only coyote entrapment 
continues at the tar pits. 


14,000 BP 


O’Keefe et al., Science 381, 746 (2023) 


18 August 2023 


RESULTS: We obtained radiocarbon dates on 
172 specimens from seven extinct and one ex- 
tant species: Smilodon fatalis, Aenocyon dirus, 
Panthera atrox, Bison antiquus, Equus occidentalis, 
Paramylodon harlani, Camelops hesternus, and 
Canis latrans, spanning 15.6 to 10.0 thousand 
calendar years before present (ka). We used the 
resulting high-resolution chronology of entrap- 
ment at RLB to analyze population dynamics 
across this time interval and the timing of local 
disappearance for different taxa. To inves- 
tigate the potential roles of late-Quaternary 
environmental change and human activities 
in driving the observed patterns, we compared 
our analyses of population structure and 
megafaunal extirpation against well-resolved 
regional and continental paleoclimatic proxies, 
vegetation records, and modeled human dem- 
ographic growth. We used time-series mod- 
eling to investigate the dynamics of ecosystem 
change and evaluate causal relationships 
among these different phenomena. 

Modeling of extinction timing using several 
methods established that all taxa except coyotes 
were extirpated from RLB by 12.9 ka, before 


12,000 BP 


the continental extinction of North Amer!_ 
megafauna. The disappearance of all taxa was 
synchronous except for camels and sloths, 
which disappeared a few hundred years ear- 
lier in concert with aridification and tree loss 
during the Bolling-Allergd. The simultane- 
ous disappearance of Smilodon, Aenocyon, 
Panthera, Equus, and Bison antiquus coin- 
cided with a regional ecological state shift 
characterized by floral community reorgan- 
ization and unprecedented fire activity. Time- 
series modeling strongly implicates humans 
as the primary cause of the state shift and 
resulting extinctions. 


CONCLUSION: Our data document a transition 
from a postglacial megafaunal woodland to a 
human-mediated chaparral ecosystem in South- 
ern California before the onset of the Younger 
Dryas. This transition began with gradual open- 
ing and drying of the landscape over two mil- 
lennia, and terminated in an abrupt (300-year) 
regime shift characterized by the complete 
extirpation of megafauna and unprecedented 
fire activity. This state shift appears to have 
been triggered by human-ignited fires in an 
ecosystem stressed by rapid warming, a mega- 
drought, and a millennial-scale trend toward 
the loss of large herbivores from the landscape. 
This event parallels processes occurring in 
Mediterranean ecosystems today. & 
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F. Robin O’Keefe’*, Regan E. Dunn2°, Elic M. Weitzel*, Michael R. Waters®, Lisa N. Martinez®, 
Wendy J. Binder’, John R. Southon®, Joshua E. Cohen””°, Julie A. Meachen??° 

Larisa R. G. DeSantis?""7, Matthew E. Kirby’?, Elena Ghezzo'*"5, Joan B. Coltrain’®, 

Benjamin T. Fuller’, Aisling B. Farrell?, Gary T. Takeuchi?, Glen MacDonald®, 


Edward B. Davis’*”°, Emily L. Lindsey??"® 


The cause, or causes, of the Pleistocene megafaunal extinctions have been difficult to establish, in 
part because poor spatiotemporal resolution in the fossil record hinders alignment of species 
disappearances with archeological and environmental data. We obtained 172 new radiocarbon dates 

on megafauna from Rancho La Brea in California spanning 15.6 to 10.0 thousand calendar years before 
present (ka). Seven species of extinct megafauna disappeared by 12.9 ka, before the onset of the 
Younger Dryas. Comparison with high-resolution regional datasets revealed that these disappearances 
coincided with an ecological state shift that followed aridification and vegetation changes during 

the Bolling-Allergd (14.69 to 12.89 ka). Time-series modeling implicates large-scale fires as the primary 
cause of the extirpations, and the catalyst of this state shift may have been mounting human 

impacts in a drying, warming, and increasingly fire-prone ecosystem. 


he disappearance of two-thirds of Earth’s 

large mammals outside of Africa at the 

end of the last Ice Age had profound im- 

pacts on global ecosystems (J, 2), was the 

largest extinction event of the Cenozoic 
(3), and represents the initial pulse in the on- 
going global extinction crisis that will likely 
result in Earth’s sixth mass extinction (4). 
Across different continents, disappearances 
of megafauna (animals weighing >45 kg) co- 
incided with both late-Quaternary climate 
changes and human colonization and growth 
(5-7). However, the causes, dynamics, and con- 
sequences of the end-Pleistocene extinctions 
remain poorly understood despite their obvious 
relevance for modern global change research. 
These lines of inquiry have been hobbled by 
the lack of reliably dated megafaunal fossils 


and the resulting lack of chronological pre- 
cision of extinction timing relative to environ- 
mental and human demographic changes (8, 9). 

We radiocarbon dated fossils from the Rancho 
La Brea lagerstatte in Southern California to 
investigate the timing and dynamics of Pleis- 
tocene megafaunal disappearance in this re- 
gion. The asphaltic deposits at La Brea preserve 
a nearly continuous record of megafaunal 
occupation of the Los Angeles Basin from 
>55 thousand calendar years before present 
(ka) through the Holocene (0), but a lack of 
stratigraphic control has limited inferences 
about megafaunal population structure, their 
history, and their ultimate demise. We devel- 
oped a high-resolution radiocarbon chronology 
for the eight most common mammal species 
at La Brea [sabertoothed cat (Smilodon fatalis), 


dire wolf enocyon dirus), coyote (Canis latrans), 
American lion (Panthera atrox), ancient bison 
(Bison antiquus), western horse (Equus occi- 
dentalis), Harlan’s ground sloth (Paramylodon 
harlani), and yesterday’s camel (Camelops 
hesternus)] from 15.6 to 10.0 ka (11) (Table 1). 
We estimated the timing of species disappear- 
ances based on last appearance dates and sta- 
tistical modeling of extinction timing (//, 12). 
We also inferred changes in entrapment rates 
over time using summed probability distrib- 
utions (SPDs). Sampling effort in our anal- 
ysis was approximately equal among species 
(rather than proportional to species occur- 
rence); therefore, SPDs reflect changes in 
intraspecific entrapment rates and relative 
population sizes over time, but not absolute 
population sizes (11). 

To investigate how large mammals were af- 
fected by late-Quaternary environmental and 
anthropogenic changes, we aligned the La Brea 
record of faunal change with well-resolved 
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Table 1. Last occurrences and modeled extirpation times for five extinct taxa at RLB. Dates are last occurrence in radiocarbon years (RC), calibrated 
last occurrence in years before 1950, and statistical GRIWM estimates of extirpation time [see section 1 of the supplementary materials (11)]. Most densely sampled 
taxa share a statistically identical extirpation time except for Camelops, which predeceases the others (**P < 0.0001, Welch's T). Panthera and Paramylodon have 
insufficient sample sizes to permit meaningful numerical comparison, but Paramylodon may disappear relatively early. 


No. dated 
29 


Taxon Last occurrence, ka 1-sigma 


13,082; 12,965-13,117 


GRIMW extirpation estimate, ka 1-sigma 
12,908; 12,763-12,996 


Last occurrence, RC, +30 


Aenocyon dirus 11,135 


13,683; 13,595-13,770 Is ols 1S) SS (S10 


13,021; 12,923-13,094 12,910; 12,821-12,998 


13,806; 13,995-13,763 13,707; 13,580-13,804 


All extinct 12,957; 12,894-13,066 
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Fig. 1. Record of relative megafaunal occurrence and extirpation at La Brea compared with the North 
American Record. (A) Median calibrated dates for Camelops (n = 17), Bison (n = 27), Equus (n = 27), 
Paramylodon (n = 8), Panthera (n = 8), Aenocyon (n = 29), Smilodon (n = 30), Canis latrans (n = 27), and 
all extinct taxa pooled (n = 144) at La Brea. Hash marks are single, calibrated median carbon dates. The 
period HS-1 refers to Heinrich Stadial 1. Red lines and confidence intervals are extinction date estimates 
generated using the GRIWM method [see the supplementary materials section 2 (11)]. Last occurences and 
modeled extirpation times are reported in Table 1. (B) SPDs for all extinct taxa at La Brea. The overall 
trend is long-term decline followed by a precipitous fall to 0 starting at ~13.3 ka (C) Stacked SPDs for all 
extinct North American megafauna south of the Laurentide Ice Sheet, excluding La Brea. Dates drawn from 
(9) and (14) and a literature review were subjected to strict quality control and divided into proboscideans 
and sloths (light gray) and all other extinct species (dark gray). The regional extirpation precedes the 
continental extinction but coincides with continental decline. 


regional paleoclimate, charcoal, and vegetation 
records and with continental-scale analyses 
of megafaunal extinction and human demo- 
graphic growth in North America. Taken to- 
gether, these data allow the first robust statistical 
modeling of extinction causality in Southern 
California. 


Extinction timing and dynamics 


We obtained accelerator mass spectrometry 
radiocarbon dates on 172 megafaunal individ- 
uals from La Brea (Fig. 1, A and B, and data S1) 
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using a customized protocol for dating bone 
collagen from asphaltic contexts (7, 13). These 
dates were then compared with a vetted com- 
pilation of published radiocarbon dates on 
North American megafauna south of Beringia 
[data S2; (11)]. The new dates presented here 
nearly double the number of reliable mega- 
faunal dates for non-Beringian North America, 
and establish a precise chronology of Pleis- 
tocene megafaunal extirpation in Southern 
California. All extinct mammals dated in this 
study have last occurrence dates older than 
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13.00 ka, with a modeled extirpation time es- 
timate across all taxa of 13.07 to 12.89 ka [using 
the Gaussian-Resampled Inverse-Weighted 
McInerney (GRIMW) extinction estimator; 
Table 1], placing the all-taxon extirpation almost 
certainly before the onset of the Younger Dryas 
(12.87 + 0.03 ka) (14). Camels and ground sloths 
disappeared earlier, with last occurrences of 
13.68 and 13.81 ka, respectively, whereas the dis- 
appearances of horses (13.03 ka), bison (13.14 ka), 
saber-toothed cats (13.02 ka), American lions 
(13.10 ka), and dire wolves (13.08 ka) are sta- 
tistically contemporaneous (Table 1). 

The disappearance of megafaunal species at 
La Brea precedes the North American mega- 
faunal extinction by at least 1000 years (Fig. 1) 
but coincides with a precipitous decline in 
summed probability for North American mega- 
fauna at ~13.3 ka (Fig. 1C). Only 25 reliable, 
direct dates on North American megafauna 
fall within the Younger Dryas; more than half 
of these are on proboscideans, which were not 
dated in this study. Just six dates fall in the 
Holocene (after 11.7 ka), and all of these were 
obtained decades ago and should be redated 
(1D). Of the seven extinct taxa examined in this 
study, only one (Camelops) has younger dates 
from elsewhere in North America. For Smilodon, 
Panthera, Aenocyon, Equus, and Paramylodon, 
the new dates reported here are the youngest 
reliably dated occurrences for North America. 
Although our database does not include any 
chronologically younger specimens diagnosed 
as B. antiqguus, some authors have argued that 
this species survived into the mid-Holocene. 
Bison bison, the presumed chronospecies of 
B. antiquus, survives in inland areas of the con- 
tinent today (15). 

Occurrence rates of extinct taxa at La Brea 
decline gradually across the Bolling-Allered, 
before beginning a precipitous drop ~13.25 ka. 
Herbivore and carnivore histories differ across 
this interval; herbivore summed probability 
declines steadily from 14.1 ka to the extinction 
(Fig. 2), whereas carnivore summed probability 
fluctuates across the interval. These differences 
in entrapment distributions are statistically 
significant, as determined by a mark permuta- 
tion test [see Fig. 2 and section 2 of the sup- 
plementary materials (17)]. The numbers of 
extinct carnivores and herbivores dated are 
approximately equal, and the relative sizes of 
SPDs normalized, so the null expectation is that 
the entrapment rates should be equivalent. 
The observed bias toward carnivore entrap- 
ment just before the extinction may reflect in- 
creased predator reliance on asphalt-entrapped 
prey as large herbivores become scarce on 
the landscape. Coyotes do not mirror this late 
increase in entrapment frequency, possibly 
reflecting their ability to switch to smaller prey 
as large prey disappear and the competition 
with larger carnivores increases (16). Further 
scrutiny of the herbivore summed probabilities 
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Fig. 2. Resampled summed probability distributions for the La Brea radiocarbon record. Lines are the 
SPDs for all taxa or indicated groups; envelopes are 95% confidence intervals based on 1000 bootstrap 
replicates. (A) All La Brea dates during the extinction interval: note that the terminal decline begins at 
~13.2 ka. (B) Taxa split out by trophic level, with distributions for all extinct carnivores (N = 67, yellow) and 
all extinct herbivores (N = 79, blue). Red stars indicate areas where the curves differ significantly, as 


determined by a mark permutation test (11). 


indicates a transition from a browser-dominated 
to a grazer-dominated herbivore communi- 
ty UD (Fig. 3). Although the relative propor- 
tion of bison remains stable throughout the 
Bolling-Allergd, camels become less common 
and horses more common over time (Fig. 3). 
This changing assemblage likely reflects chang- 
ing vegetation that favored grazers over brows- 
ers (17, 18). 

Coyotes (Canis latrans) are one of the few 
large mammal species to survive the extinction, 
so they serve as a “taphonomic control taxon” 
(19). Coyote occurrence rates drop coincident 
with those of La Brea megafauna, but occur- 
rences resume at 12.54 ka (Fig. 1) and this 
continues into the Holocene. The continued 
deposition of postextinction coyotes is prima 
facie evidence that the potential for large mam- 
mal entrapment remained at La Brea. The ab- 
sence of megafaunal deposition reflects their 
extirpation from the region rather than a 
taphonomic artifact. Neither taphonomic ef- 
fects nor other sampling biases can explain the 
differences in herbivore and carnivore demo- 
graphics noted above. However, these demo- 
graphic shifts and subsequent extirpations 
coincided with profound ecological transitions 
in Southern California. 


Climate and vegetation history 


Climate warming from Heinrich Stadial 
1 through the Bolling-Allergd in the Northern 
Hemisphere is well documented (20), but late- 
Pleistocene climate dynamics vary regionally. 
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In Southern California, temperature and pre- 
cipitation proxies indicate that the megafaunal 
disappearance at La Brea follows a period of 
significant regional warming and drying, par- 
ticularly winter drying (J/, 21, 22) (Fig. 3). 
Marine cores from the Santa Barbara Basin 
show warming of surface waters of 7 to 8°C 
from 17 to 15 ka (23). Mean annual air temper- 
ature proxy records from inland Lake Elsinore, 
~100 km southeast of La Brea, show a later 
warming of 5.6°C between 14.0 and 13.0 ka 
and an additional 4.4°C warming from 13.0 to 
11.8 ka (24) (Fig. 3C). Deglacial changes in 
hydroclimate toward drying at the onset of the 
Bolling-Allerod (14.7 ka) are evidenced at Lake 
Elsinore by decreased sand input from runoff, 
a proxy for precipitation (Fig. 3D), and leaf 
wax hydrogen isotopic records (21, 24, 25). 
Additionally, a steep rise in salinity caused 
by increased evaporation relative to freshwater 
input into Lake Elsinore is indicated between 
13.7 and 13.2 ka (Fig. 3D) (24). 

Pollen records from Lake Elsinore track the 
floral response to this warming and drying 
trend. Juniper Juniperus spp.), a coniferous 
shrub or tree, declines with postglacial warm- 
ing and drying starting ~16 ka (Fig. 4) after 
being dominant throughout the Last Glacial 
Maximum. This decline has been documented 
in most California pollen records (22, 26-28). 
Oak (Quercus spp.) increases sharply at the 
beginning of the Belling-Allerod, peaking in 
abundance at 13.6 ka. Then, between 13.2 and 
12.9 ka, both oak and juniper decline, with al- 
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most complete extirpation of juniper by 12.87 ka. 
These taxa are then replaced by species with 
high fire resistance, including fire-tolerant 
pines (29) and grass and chaparral taxa (30). 
Although juniper is generally drought tolerant, 
itis vulnerable to soil water depletion from dry 
winters followed by hot, dry summers (37) and 
has low fire resistance (32). Today, severe juni- 
per mortality is occurring at low-elevation sites 
in the Southwestern United States due to warm- 
ing and drought (37). 

To quantify floristic change during the ex- 
tinction interval, we performed a nonmetric 
multidimensional scaling (NMDS) (Fig. 3E) 
(1D) analysis on published Lake Elsinore pollen 
counts (29). NMDS axis 1 (NMDS1) scores 
reflect proportional changes of woodland tree 
(Juniperus, Pinus, and Quercus) versus chap- 
arral taxa (Asteraceae, Rhamnaceae/Rosaceae, 
Amaranthaceae, Eriogonum, Cyperaceae, and 
Poaceae). Lower NMDSI values indicate more 
short-statured, open, and xeric vegetation. The 
primary trend in pollen NMDS1 scores is a 
decline from 16.0 to 12.0 ka, indicating a dry- 
ing and opening of habitats that correlates 
with the pattern of declining herbivore abun- 
dance (Fig. 3). The proportional replacement 
of camels and sloths by horses in the herbivore 
community (Fig. 3B) is consistent with this 
transition to a more open and herbaceous veg- 
etational regime. Breakpoint analysis of the 
NMDSI time series detects two periods of di- 
rectional change in vegetation occurring at 
13.88 and 13.21 ka (Fig. 3E). These breakpoints 
separate three stable trends in the vegetation: 
(i) the gradual opening and drying of habitats 
from 16.0 to 13.88 ka; (ii) a brief stabilization 
of this trend between 13.88 and 13.21 ka; and 
(iii) a rapid and punctuated decline in tree 
cover 13.21 to 12.90 ka, which is followed by 
continued opening of landscapes. Given the 
severity and rapidity of this last interval, we in- 
terpret this NMDS shift as a period of drought, 
probably driven primarily by evaporative water 
loss in response to rapidly rising temperatures 
(Fig. 3C). The modest decline in the sand-based 
proxy for runoff indicates that decreased pre- 
cipitation may be contributing to increased 
aridity. A sharp increase in salinity concurrent 
with the substantial drop in NMDS values 
supports the inference of aridity during this 
time (Fig. 3D). Impacts to vegetation during 
this event include: (i) a tripling in the amount 
of grass pollen from 5 to 15%; (ii) a substantial 
decline in juniper and oak; (iii) an increase in 
herbaceous Asteraceae and Cyperaceae; and 
(iv) a slight increase in fire-adapted pine and 
chaparral shrubs (Rhamnaceae and Rosaceae). 
All of this coincides with the terminal decline 
and extirpation of the La Brea megafauna. 


Fire activity 


We analyzed charcoal accumulation rates from 
the Lake Elsinore sediment core (LEDC10-1), 
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Fig. 3. Summed probability distributions 
for subsets of the La Brea radiocarbon 
record and end-Pleistocene records 

of biotic and abiotic change in Southern 
California. (A and B) SPDs for La Brea 
carnivores (N = 67) (A) and herbivores 

(N = 79) (B). The line at 14.3 ka marks the 
bins for the pie chart and chi-square tests 
(11). Proxy climate data from the Lake Elsinore 
core (LEDC10-1) including: mean annual air 
temperature (MAAT; red) derived from (24) 
(C) and sand fraction (blue), a proxy for 
precipitation (25) and salinity (gray) (24) 
(D). The marked rise in salinity reflects 
increasingly arid conditions driven more by 
rising temperatures rather than precipitation 
decrease. (E) NMDS1 scores depicting 
change in pollen composition from Lake 
Elsinore [see the materials and methods 
(11); data are from (29)]. Green vertical lines 
indicate shifts between regression regimes 
recovered from a subsidiary breakpoints 
analysis on pollen NMDS [see section 4 of 
the supplementary materials and methods 
(11)]. A steady decline in NMDS is followed by 
a brief stable period, then an abrupt floral 
transition toward a more open chaparral 
habitat containing more drought-tolerant and 
fire-adapted species starting at ~13.3 ka. 

(F) Charcoal particles per square centimeter 
per year from the Lake Elsinore core. The 
age model for the Lake Elsinore core, and 
associated error, can be found in fig. S1 and 
table S2. 
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Fig. 4. Record of floral change from 16 to 12 ka based on pollen counts from Lake Elsinore. (A and 

B) Pie graphs showing vegetation abundance at 15.4 ka (A) and 13.0 ka (B) from Lake Elsinore (27). 

(C) Light gray bars denote the HS-1 and Younger Dryas cooler periods; red bar shows the GRIWM estimate 
for the La Brea megafaunal extirpation (Fig. 1). A shift from woodland tree dominance to xeric and fire- 
adapted, grassy and chaparral taxa occurs concurrently with the La Brea regional extinction. Percentages are 
of taxa representative of chaparral or woodland plant communities only; not all taxa present are shown. 


measured as particles per square centimeter 
per year, from sediment spanning 16.0 to 
12.0 ka UD) (Fig. 3F). Charcoal accumulation 
rates are low before the appearance of humans 
in the archeological record [including during 
times of previous drought (29, 33)] and in- 
crease modestly as humans arrive and the 
climate warms and dries beginning ~13.5 ka. 
However, ~13.2 ka, charcoal accumulation rates 
suddenly increase 30-fold. This interval of sig- 
nificantly heightened charcoal input lasts for 
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~300 years and is attested in other regional 
charcoal records (34), the Santa Barbara Basin 
at 13.0 ka (35), and continent wide between 
13.2 and 13.0 ka (36). This interval appears to 
be a tipping point for fire regimes in Southern 
California and across North America; after the 
initial spike, charcoal accumulation rates re- 
main elevated relative to earlier Pleistocene 
levels during the Younger Dryas and then in- 
crease again in the Holocene (36). Although 
the increase in charcoal input observed in the 
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Southern California record is consistent with 
observed changes in climate and vegetation, as 
well as with the disappearance of megaherbi- 
vores (37), its magnitude is unprecedented in 
the 33,000-year record (33). Its ignition source 
is open to question, but increased human im- 
pact on the ecosystem should be considered as 
a potential causal factor in the intense burning 
event ~13.2 ka. 


Human record 


Genetic and archeological data support a hu- 
man presence south of the North American ice 
sheets by 16 ka (38, 39) and possibly earlier. 
The megafauna-specialized Clovis technology 
is often invoked as the likely driver of North 
American megafaunal extinctions (40). How- 
ever, the emergence and geographic expansion 
of the Clovis complex (~13.05 to 12.75 ka) (47) 
postdates the crash in megafaunal occurren- 
ces at La Brea at 13.25 ka, as well as the last 
occurrences of all dated taxa except Smilodon 
and Equus. Clovis also emerges after the North 
American megafauna begin to decline, but co- 
incides with the final megafaunal decline across 
the continent (Figs. 1 and 5) (41). The modeled 
extirpation time for La Brea megafauna (13.07 
to 12.89 ka) suggests an overlap with early Clovis, 
but Clovis continues for at least 140 years after 
the extirpation is complete. 

The oldest unequivocal evidence of human 
presence in California is a partial skeleton 
from Arlington Springs on Santa Rosa Island 
~12.89 ka (42) (Fig. 5A). No other archeological 
evidence of late-Pleistocene human presence 
or association between humans and mega- 
fauna has been documented in California. 
However, such sites would be rare and difficult 
to find because most deposits of this age are 
deeply buried or underwater (43). To better 
understand the potential impacts of humans 
on megafauna in Southern California, we con- 
structed an SPD for human occupation sites 
in North America (between southern Canada 
and northern Mexico) to serve as a proxy for 
human population in the region (JZ). Data 
from the Canadian Archeological Radiocarbon 
Database (CARD) were subjected to strict data 
hygiene (77), and the probability distribution 
was binned to control for variation in sam- 
pling intensity (Fig. 5A). Our model indicates 
that North American human population den- 
sity is low until ~13.2 ka, when it begins a sharp 
increase that continues into the Younger Dryas. 


State shift 


Together, these records of climate and vege- 
tation change, megafaunal extirpation, human 
demographic growth, and biomass burning 
document a profound shift in ecosystem struc- 
ture in Southern California near the end of the 
Bolling-Allerod. We analyzed the correlations 
among these time series using principal compo- 
nents analysis and applied a Bayesian structural 
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Fig. 5. Correlation among Southern California ecosystem variables. (A) Summed probability distribution of 
human occupation in North America derived from the CARD database (black line). Blue bars present archaeological 
data including age range for Clovis, earliest evidence of human occupation along the Pacific Coast, “AS” is the 
date of Arlington Springs Man, and earliest evidence of human occupation west of the Rocky Mountains [see 
materials and methods (11)]. (B) The fire record at Lake Elsinore as recorded by particles of charcoal per square 
centimeter per year; series is a 200-year running mean. (C) PCl scores for ecosystem variables through time. 
Red regression lines are four regimes identified by breakpoint analysis [see section 4 of the supplemenary 
materials (11)]: the drying transition out of the HS-1 and initial human arrival; relative stability for much of the 
Bglling-Allerad while humans were scarce (“Megafaunal Woodland”); an abrupt state shift from 13.3 to 13.0 ka 
(“State Shift”); and the establishment of a new regime (“Chaparral”). 


change analysis to identify significant shifts in 
principal component 1 (PCI) scores (11) (Fig. 5). 
These analyses identify four distinct periods of 
stable regression across the Bolling-Allered, 
most notably an ecosystem state shift from 
13.3 to 12.9 ka (Fig. 5). This state shift coincides 
with a 30-fold regional increase in charcoal 
accumulation rates and a shift in the floral 
community toward fire-adapted species. Dur- 
ing this interval, tree abundance plummets and 
megafaunal occurrence rates fall to zero. At 
the end of the state shift, Southern California 
enters a new ecological regime characterized 
by chaparral vegetation, intensified fire activity, 
and complete absence of Pleistocene megafauna. 
Many large mammals that are associated with 
fire-adapted ecosystems today (e.g., elk, moose, 
and grizzly bear) likely do not arrive in non- 
Beringian North America until the early Holo- 
cene (44-46). Others, such as deer and pumas, 
are present but rare at La Brea. 

Across this time interval, the coefficient for 
human population size is strongly negatively 
correlated with those for megafaunal popula- 
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tions and glacial floral communities (Fig. 5C). 
The trend over the Bolling-Allerod is toward 
an increasingly fire-prone ecosystem. The un- 
precedented biomass burning observed dur- 
ing the state shift could have resulted from 
the confluence of extreme warming and dry- 
ing, fuel accumulation resulting from reduction 
of grazing herbivores (37), and new, anthro- 
pogenic ignition sources. Climate change thus 
may have facilitated the extinction by pushing 
the ecosystem toward a state where anthropo- 
genic activities could trigger widespread fires. 
In Southern California, the Pleistocene mega- 
fauna are already gone, and the transition toward 
a Holocene chaparral is well advanced, before 
the Younger Dryas begins. 


Causality 


Numerous studies have found correlations be- 
tween the timing of megafaunal disappear- 
ances and changes in climate (9), vegetation 
(2), human populations (5), and fire activity 
(37). We leveraged our highly resolved data- 
sets to quantify causality using variable auto- 
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regression modeling with exogenous variables 
(VARX) (17) (Fig. 6). Vector autoregressive 
models use time-series data to identify wheth- 
er, and to what degree, past values of one var- 
iable predict present values of others (47, 17). 
Our time series included two exogenous var- 
iables: local precipitation inferred from per- 
cent sand from Lake Elsinore and NMDS1 
scores from Lake Elsinore pollen. The NMDS1 
scores are highly correlated with the MAAT 
estimates from Lake Elsinore (R® = 0.67; P = 
1.18 x 10°”) (11), making pollen our most den- 
sely sampled proxy for terrestrial air temper- 
ature. Endogenous variables included charcoal 
accumulation rates from Lake Elsinore, the 
megafaunal probability distributions from 
La Brea, and the North American human prob- 
ability distribution. The average time step was 
43 years, and there were 57 steps across a 
2382-year span. We limited interpretation of 
the model to the first two time lags (~85 years), 
or two parameters per time series. We ran the 
VARX model both including and excluding 
our human SPDs for North America. The 
VARX model fit is much better with the hu- 
man distribution included [Akaike informa- 
tion criterion (AIC) of -19.71 versus -9.76 with 
and without humans, respectively; P < 0.001]. 
Given that an AIC difference of 2 is generally 
considered meaningful, this decrease of 10 is 
robust evidence that humans are a strong 
forcing factor in the model. 

The VARX model (Fig. 6) identified several 
significant time-lagged causal relationships. 
These include: (i) climate warming and aridi- 
fication predict an increase in charcoal input; 
Gi) declines in pollen NMDS1 also forecast an 
increase in charcoal input; and (iii) human 
population growth predicts a decrease in her- 
bivore populations and, most strongly, an in- 
crease in charcoal input. Decrease in herbivore 
numbers at one lag predicts an increase in 
carnivore entrapment bias at La Brea. Some 
effects, such as the impact of vegetation change 
and fire activity on herbivore populations, likely 
occurred too rapidly to be captured by the 
43-year time lag. The model supports infer- 
ence of a potential positive feedback loop in 
which rising human populations cause en- 
hanced fire activity both indirectly by depress- 
ing herbivore numbers (resulting in increased 
fuel loads) and by increasing ignition. 

Small populations of humans can have dis- 
proportionate impacts on landscapes through 
the use of fire (48); significant increases in re- 
gional fire activity after the arrival of humans 
have also been noted in Australia (49), New 
Zealand (50), Panama (5/), and many other 
regions worldwide (52). Today, changing fire 
regimes resulting from climate change and 
human activities are again driving some eco- 
systems toward tipping points (53). Not only 
can fire cause direct mortality of wildlife, but it 
can also alter the structure and function of 


6 of 8 


RESEARCH | RESEARCH ARTICLE 


Fig. 6. Summary time-series VARX model of 
ecosystem variables expressed as a structural 
equation model. VARX 6,6 model incorporating 
probability distributions from La Brea herbivores 
(“Herb”) and carnivores (“Carn”), charcoal 

from Lake Elsinore (“Char”), and our SPD for the 
North American human population (“Homo”). 
Exogenous variables include Lake Elsinore 

%Sand, a proxy for precipitation, and NMDS of 
Lake Elsinore pollen (“Pollen”), a proxy for 
onshore air temperature. Predictive relationships 
with effect size >0.4 are shown by red arrows 
with width scaled to coefficient magnitude. 

For exogenous variables, all effects are shown. 
See section 5 of the supplementary materials 

for values (11). Self-prediction is highly significant in 
most cases but is omitted because the probability 
distributions are smoothed to 200 years and 

the average step length is 43 years. F 


For a further 
discussion of methods, see the supplementary 
materials (11). 


vegetation, which affects the availability of key 
floral resources for animals, alters migration 
patterns, increases energetic costs of move- 
ment, and can put animals at higher risk of 
predation (53). Humans arriving in Southern 
California in the latest Pleistocene encount- 
ered a warming and increasingly arid climate 
coupled with ample flammable fuels. Anthro- 
pogenic hunting and burning could have pre- 
cipitated a state shift toward today’s chaparral 
ecosystem (Fig. 5). 

The debate over the cause of the Pleistocene 
megafaunal extinctions has raged for decades 
(5). This study demonstrates the necessity of 
moving beyond dichotomous statements about 
single extinction drivers and instead moving 
toward a more nuanced view of past extinc- 
tions, one that considers the interplay among 
biotic and abiotic causal factors. Our results 
also highlight the importance of considering 
extinction dynamics on ecologically relevant 
spatial, temporal, and taxonomic scales. Studies 
from the northeastern United States (2), the 
Pacific Northwest (52), and Alaska (54) have 
also found pre-Younger Dryas disappearances 
of megafauna coinciding with climate-driven 
environmental changes, and radiocarbon dates 
from the Southwestern United States indi- 
cate that megafauna may have persisted there 
well into the Younger Dryas (Fig. 1 and data 
82). The results of our study also suggest that 
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Char, 


Endogenous 


individual taxa at La Brea responded differently 
to climate-driven vegetation shifts (Fig. 3) 

A climate-human synergy such as the one 
we implicate in California’s megafaunal ex- 
tinctions may portend future ecological state 
shifts (55). Data from the National Oceanic 
and Atmospheric Administration (NOAA) show 
that Southern California has warmed >2°C over 
the past century, an order of magnitude faster 
than warming during the Bglling-Allerod. 
Anthropogenic climate warming is increas- 
ing the risk of prolonged droughts and wild- 
fire activity in highly biodiverse Mediterranean 
regions worldwide (56). These events are pre- 
dicted to worsen in coming decades, affecting 
wildlife already experiencing population de- 
clines caused by other factors (57). As critical 
thresholds are reached in Mediterranean eco- 
systems, state shifts are likely to occur, as they 
did at the end of the Pleistocene. Some such 
transitions have already begun: in the western 
United States, wildfire-burned area has in- 
creased fourfold in two decades (58). More- 
over, postfire ecosystems are not recovering 
to preburned states, suggesting that critical 
thresholds for re-establishment have already 
been crossed (59). The conditions that led to 
the end-Pleistocene state shift in Southern Cal- 
ifornia are recurring today across the western 
United States and in numerous other ecosys- 
tems worldwide. Understanding the interplay 
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of climatic and anthropogenic forcings in driv- 
ing the La Brea extinction event may be helpful 
in mitigating future biodiversity loss in the face 
of similar pressures. 
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HUMAN DEVELOPMENT 


Yolk sac cell atlas reveals multiorgan functions 
during human early development 


Issac Goht and Rachel A. Botting+ et al. 


INTRODUCTION: The yolk sac (YS) generates the 
first blood and immune cells and provides 
nutritional and metabolic support to the de- 
veloping embryo. Our current understanding 
of its functions derives from pivotal studies 
in model systems, and insights from human 
studies are limited. Single-cell genomics tech- 
nologies have facilitated the interrogation of 
human developmental tissues at unprece- 
dented resolution. Atlases of blood and im- 
mune cells from multiple organs have been 


greatly enhanced by focused, time-resolved 
analyses of specific tissues. 


RATIONALE: To characterize the functions of 
the human YS, we performed single-cell RNA 
sequencing (scRNA-seq) and cellular indexing 
of transcriptomes and epitopes by sequencing 
(CITE-seq) on the YS and paired embryonic 
liver. After integration with external datasets, 
our reference comprised 169,798 cells from 10 
samples spanning 4 to 8 postconception weeks 
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Multiorgan functions of the human YS. We characterized functions of the developing human YS, combining 
scRNA-seq and CITE-seq with 2D and 3D imaging techniques. Our findings revealed YS contributions to metabolic and 
nutritional support and to early hematopoiesis. We characterized myeloid bias in early hematopoiesis, distinct myeloid 
differentiation trajectories, evolutionary divergence in initial erythropoiesis, and YS contributions to developing 

tissue macrophages. Met., metabolic; Coag., coagulation; Mac, macrophage. [Figure created with Biorender] 
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dat 
ertoire of two-dimensional (2D) and 3 Dalasi 


aging techniques provided spatial context and 
validation. We compared the products of two 
hematopoietic inducible pluripotent stem cell 
(PSC) culture protocols against our reference. 


RESULTS: We determined that YS metabolic and 
nutritional support originates in the endoderm 
and that the endoderm produces coagulation 
proteins and hematopoietic growth factors [eryth- 
ropoietin (EPO) and thrombopoietin (THPO)]. 
Although metabolic and coagulation protein 
production was conserved among humans, mice, 
and rabbits, EPO and THPO production was ob- 
served in humans and rabbits only. 

We reconstructed trajectories from the YS 
hemogenic endothelium to early hematopoi- 
etic stem and progenitor cells (HSPCs). Using 
transcriptomic signatures of early and defini- 
tive hematopoiesis, we parsed YS HSPCs into 
myeloid-biased early HSPCs and lymphoid- 
and megakaryocyte-biased definitive HSPCs. 
Human embryonic liver remained macroscop- 
ically pale before CS14, when hematopoietic cells 
first emerge from the aorta-gonad-mesonephros 
(AGM) region. Tracking hemoglobin (Hb) sub- 
types led us to conclude that initial erythro- 
poiesis is YS restricted. By contrast, in mice, Hb 
subtypes suggested two waves of pre-AGM eryth- 
ropoiesis, including maturation in the macro- 
scopically red embryonic liver. 

Before CS14, monocytes were absent and mac- 
rophages originated from HPSCs via a pre- 
macrophage cell state. After CS14, monocytes 
emerged and a second, monocyte-dependent 
differentiation trajectory was reconstructed. A 
rare subset of TREM2* macrophages, with a 
microglia-like transcriptomic signature, was 
present after CS14. The iPSC system optimized 
for macrophage production recapitulated the 
two routes to macrophage differentiation but 
did not generate the diversity of macrophages 
(including TREM2* macrophages) observed in 
developing tissues. 


CONCLUSION: Our study illuminates a previous- 
ly obscure phase of human development, where 
vital functions are delivered by the YS acting as a 
transient extraembryonic organ. Our compre- 
hensive single-cell atlas represents a valuable 
resource for studying the cellular differentiation 
pathways specific to early life and leveraging these 
for tissue engineering and cellular therapy. 
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HUMAN DEVELOPMENT 


Yolk sac cell atlas reveals multiorgan functions 
during human early development 
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The extraembryonic yolk sac (YS) ensures delivery of nutritional support and oxygen to the developing 
embryo but remains ill-defined in humans. We therefore assembled a comprehensive multiomic 
reference of the human YS from 3 to 8 postconception weeks by integrating single-cell protein and gene 
expression data. Beyond its recognized role as a site of hematopoiesis, we highlight roles in metabolism, 
coagulation, vascular development, and hematopoietic regulation. We reconstructed the emergence 
and decline of YS hematopoietic stem and progenitor cells from hemogenic endothelium and revealed a 
YS-specific accelerated route to macrophage production that seeds developing organs. The multiorgan 
functions of the YS are superseded as intraembryonic organs develop, effecting a multifaceted relay 


of vital functions as pregnancy proceeds. 


he primary human yolk sac (YS) derives 

from the hypoblast at the time of em- 

bryo implantation [Carnegie stage 4 (CS4); 

~1 postconception week (PCW)] (J, 2). 

The secondary YS supersedes the pri- 
mary structure at around CS6 (~2.5 PCW) and 
persists until CS23 (~8 PCW) (J, 2). The sec- 
ondary YS has three tissue compartments: the 
mesothelium (an epithelial layer interfacing 
the amniotic fluid), the mesoderm (including 
endothelial cells, blood cells, and smooth mus- 
cle), and the endoderm (an inner layer interfac- 
ing the vitelline fluid-filled YS cavity) (7). The 
functions of the YS in nutrient uptake, trans- 
port, and metabolism are phylogenetically con- 
served (2). 

Hematopoiesis originates in the YS of mam- 
mals, birds, and some ray-finned fishes (3). The 
first wave of mouse YS hematopoiesis yields pri- 
mitive erythroid cells, macrophages, and mega- 
karyocytes (MKs) from embryonic day 7.5 (E7.5) 
(3, 4). After circulation begins, a second wave 
of erythromyeloid and lymphomyeloid pro- 
genitors arise in the YS and supply the embryo 
(5). Finally, definitive hematopoietic stem cells 
arise in the aorta-gonad-mesonephros (AGM) 
region of the dorsal aorta and seed the fetal 
liver (6). Limited evidence suggests that the YS 
also provides the first blood cells during hu- 
man development. Primitive erythroblasts ex- 
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pressing embryonic globin genes, surrounded by 
endothelium, emerge in the YS at CS6 (~2.5 PCW) 
(7, 8). Hematopoietic progenitors and macro- 
phages are detectable at CS11 (~4 PCW) (9), with 
MKs, monocytes, mast cells, and innate lym- 
phocytes also reported (9, 10). Long-term multi- 
lineage repopulating (definitive) hematopoietic 
stem and progenitor cells (HSPCs) originate in 
the AGM at CS14 (~5 PCW) (11). Equivalent cells 
are subsequently found in the YS at CS16 and 
the liver from CS17 (11, 12). 

In this study, we report a time-resolved atlas 
of the human YS, combining single-cell pro- 
tein and gene expression with imaging. This 
provides a comprehensive depiction of the 
metabolic and hematopoietic functions of the 
human YS as well as a benchmark for in vitro 
culture systems aiming to recapitulate human 
early development. 


A single-cell atlas of human YS 


We performed droplet-based single-cell RNA se- 
quencing (scRNA-seq) to profile the human YS 
and integrated with external datasets to yield 
169,798 high-quality cells from 10 samples span- 
ning 4 to 8 PCW (CS10 to CS23), which can be 
interrogated on our interactive web portal [https:// 
developmental.cellatlas.io/yolk-sac (13)] Fig. 1, 
Ato C; fig. S1, A to C; and data S1 to $18). Graph- 
based Leiden clustering yielded 39 cell types 
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grouped into 15 broad categories, including 
hematopoietic cells, endoderm, mesoderm, and 
mesothelium. Key marker genes were validated 
by plate-based scRNA-seq (Smart-seq2) (Fig. 1, 
C and D; fig. S1, B to F; and data S3 to S5, S8, 
S17, and S18). We used the term “HSPC’” for cells 
collectively on the basis of their expression of a 
core HSPC signature (e.g., CD34, SPINK2, and 
ALF) without implying long-term repopulat- 
ing capacity or multilineage potential. With 
comparison datasets, unless otherwise speci- 
fied, we adopted published annotations (data 
S6 and S7). Surface protein expression from 
cellular indexing of transcriptomes and epi- 
topes by sequencing (CITE-seq) of 2 = 2 YS cell 
suspensions (fig. S1, G and H) identified com- 
binatorial antigens for cell purification and 
functional characterization (fig. S2A and data 
S9, S19, and S20). We generated matched em- 
bryonic liver scRNA-seq and CITE-seq data 
(fig. S2, B to F; fig. $3, A to C; and data S5, S10, 
$21, and S22), which confirmed the presence 
of discrete B cell progenitor stages only in the 
liver (fig. S3C and data S12). Around half of the 
YS lymphoid cells were innate lymphoid pro- 
genitors, which terminated in natural killer 
(NK) and innate lymphoid cell (ILC) precursor 
states on force-directed graph (FDG) visual- 
ization (fig. S3D). A small population of cells 
were referred to as “lymphoid B lineage” be- 
cause of their expression of CD19, CD79B, and 
IGLLI. These cells did not express the typical 
B1 markers CD5, CD27, or CCR10, however. Given 
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Fig. 1. A single-cell atlas of the 
human YS. (A) Schematic of 
experimental outline. (B) Summary 
of data included in analyses. 
Squares represent new data, and 
triangles represent published data: 
YS (10, 12, 48, 63), AGM (12), liver 
(10), fetal BM (34), fetal brain (55), 
fetal skin (48), fetal kidney (74), 
fetal gonads (49), mouse (75), and 
iPSC (12, 20). Color indicates 
assay used (data S6). (©) UMAP 
visualization of YS scRNA-seq data 
(n = 10; k = 169,798). Colors 
represent broad cell states. Mac, 
macrophage; MEM, megakaryocyte- 
erythroid—mast cell lineage; pre., 
precursor; prog., progenitors. 

(D) (Left) Dot plot showing the mean 
expression (color) and proportion of 
cells expressing genes (dot size) 

of broad cell states in YS scRNA-seq 
data. (Right) Equivalent protein 
expression (color) and proportion of 
cells expressing proteins (dot size) 
from YS CITE-seq data (n = 2; 

k = 3578). Equivalent gene names 
are in parentheses. One asterisk 
indicates genes validated by RNA- 
scope, and two asterisks indicates 
proteins validated by IHC and/or 
immunofluorescence (IF) (data S4). 
Data are variance-scaled and min- 
max-standardized. (E) (Left) 
Light-sheet fluorescence microscopy 
of CD34* and LYVE1* vascular 
structures in YS (representative 
~6.9 PCW sample). Scale bar, 

500 um (movie S1). (Right) 
Immunofluorescence of an ~8-PCW 
YS highlighting endoderm (ASGRI1; 
red) and endothelium (CD34; yellow) 
costained with DAPI (cyan). Scale 
bar, 100 um (data S23). (F) RNA- 
scope of YS (representative 8-PCW 
sample). (Left) Endoderm (SPINK1; 
yellow), smooth muscle (ACTA2: 
red), AEC (/L33; blue), and macro- 
phages (C1QA; magenta). Scale bar, 
200 um. (Right) DCs (CDIC; yellow 
box) and macrophages (C1QA; 
magenta box). Scale bar, 50 um. 
Individual channels are shown in 
fig. S4A. (G) (Left) Bar graph 
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showing the proportion representation of cell states in YS scRNA-seq data by gestational age. (Right) Milo beeswarm plot of YS scRNA-seq neighborhood 
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differential abundance across time. Blue and red neighborhoods are significantly enriched earlier and later in gestation, respectively. Color intensity denotes degree 


of significance (data S24). 


the absence of distinct B cell progenitor stages 
and their later emergence (>5 PCW), these 
may constitute migratory B cells of fetal liver 


origin (fig. $3, D and E). 


Three-dimensional visualization of the YS by 
light-sheet microscopy marked the CD34" LYVEI° 
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vitelline artery and CD34'°LYVE1"™ vitelline 
vein contiguous with a branching network of 
CD34°LYVE!" vessels (Fig. 1E, fig. S3F, data $23, 
and movies S1 and $2). The CD34°LYVEI™L33* 
vessels were situated within the mesoderm, 
a distinct layer beneath the ASGRI*SPINKI* 
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endoderm (Fig. 1, E and F; fig. $3, F to I; and 
fig. S4, A and B). ACTA2* smooth muscle cells 
formed a sublayer between the mesoderm and 
endoderm (Fig. IF and fig. $4, A and B). Mac- 
rophages (CIQA*CDIC*’-) and a small number 
of dendritic cells (DCs) (CIQA*/“CDIC*) were 
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identified within the mesoderm (Fig. 1F and 
fig. S4A). 

The most prevalent hematopoietic cell types 
in the early YS (CS10; ~4 PCW) were HSPCs, 
erythroid cells, macrophages, and MKs. Both 
HSPCs and MKs proportionately diminished 
thereafter, whereas erythroid cells and macro- 
phages were sustained. DCs and TREM2* mac- 
rophages did not emerge until >6 PCW (Fig. 
1G and data S4). The ratio of hematopoietic to 
nonhematopoietic cells was around 3:1 in the 
early YS (CS10; ~4 PCW), with endoderm rela- 
tively abundant (Fig. 1G). The ratio approached 
1:3 in the late YS (CS22 to CS23; ~8 PCW) be- 
cause of the expansion of fibroblasts (Fig. 1G). 
The transcriptional profile of MKs was con- 
sistent across gestation, but both erythroid cells 
and macrophages had early and late gestation- 
specific molecular states, suggesting dual waves 
of production (Fig. 1G, fig. S4C, and data S24). 


Multiorgan functions of YS 


YS endoderm coexpressing APOAI/2, APOC3, 
and 7TR (similar to embryonic or fetal hepato- 
cytes), was present from gastrulation at ~2 to 
3 PCW (14) (Fig. 2A and data S3, S7, and S21). The 
YS endoderm expressed higher levels of serine 
protease 3 (PRSS3), glutathione S-transferase 
alpha 2 (GSTA2), and multifunctional protein 
galectin 3 (LGALS3) compared with embryonic 
liver hepatocytes, whereas hepatocytes expressed 
amore extensive repertoire of detoxification 
enzymes, including alcohol and aldehyde de- 
hydrogenases and cytochrome P450 enzymes 
(fig. S4D and data $3, $7, and S21). Both cell 
states shared gene modules implicated in co- 
agulation and lipid and glucose metabolism 
(Fig. 2B and data S25), which were also con- 
served in mouse and rabbit extraembryonic 
endoderm (fig. S4, E and F, and data S25). 
The expression of transport proteins (alpha- 
fetoprotein and albumin), a protease inhibitor 
(alpha-1-antitrypsin), erythropoietin (EPO), and 
coagulation proteins (thrombin, prothrombin, 
and fibrin) were validated in human YS en- 
doderm and embryonic liver hepatocytes (Fig. 
2C and fig. S4G). 

From the earliest time points, the YS endo- 
derm expressed genes for anticoagulant pro- 
teins antithrombin IIT (SERPINCT) and protein 
S (PROSD) and components of the tissue factor- 
activated extrinsic coagulation pathway—thrombin 
(F2), factor VII (F7), and factor X (FI0) (Fig. 2D 
and data S25), confirmed at the protein level for 
thrombin (Fig. 2C). Intrinsic pathway factors 
VII, IX, XI, and XII (F8, F9, F11, and F12) were 
minimally expressed in the YS but were ex- 
pressed by embryonic liver hepatocytes (Fig. 2D). 
Tissue factor, antithrombin III, and fibrinogen 
subunits were also expressed in mouse extra- 
embryonic endoderm and rabbit YS endoderm 
(fig. S4E). Embryonic lethality of homozygous- 
null mice lacking prothrombin, thrombin, and 
coagulation factor V before liver synthetic func- 
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tion (i.e., at E9 to E12) implies functional rele- 
vance of YS expression (Fig. 2D and data S25) 
(15, 16), whereby coagulant and anticoagu- 
lant pathways develop in parallel to balance 
hemostasis. 

YS endoderm cells expressed EPO and throm- 
bopoietin (THPO) that are critical for erythro- 
poiesis and megakaryopoiesis (Fig. 2, C and 
D; fig. S4H; and data $25 and 826). In mouse 
development, EPO is produced by the fetal liver 
and is only essential for definitive and the later 
stages of primitive erythropoiesis, with Epo/Epor- 
knockout mice dying at around E13 (77). An 
EPO source before liver development is there- 
fore likely not needed in mice. Accordingly, EPO 
has not been found in the mouse YS (1/8) (fig. 
S4E). In parallel to the human YS, rabbit YS en- 
doderm also produced EPO at gestational stages 
preceding liver development (fig. S4E). We com- 
piled a 12-organ integrated human fetal atlas 
spanning 3 to 19 PCW (k = 3.12 million cells, 
n = 150 donors; fig. S8, G and H, and data S6 
and S7) and observed that EPO and THPO 
production were restricted to the YS and liver 
(fig. S4H), specifically to YS endoderm and liver 
hepatocytes (fig. S41). Differentially expressed 
genes (DEGs) between the early and late YS en- 
doderm revealed active retinoic acid and lipid 
metabolic processes until '7 PCW, after which 
genes associated with cell stress and death were 
expressed (Fig. 2E and data S26). A decline in 
the proportion of YS endoderm cells produc- 
ing EPO was compensated by the onset of EPO 
production by hepatocytes at 7 PCW (fig. S4J). 
Thus, the human YS plays a critical role support- 
ing hematopoiesis, metabolism, coagulation, 
and erythroid cell mass regulation before these 
functions are taken over by the embryonic or 
fetal liver and then, ultimately, by the adult liver 
(metabolism and coagulation), bone marrow 
(BM) (hematopoiesis), and kidney (erythroid 
cell mass regulation) (Fig. 2F). 


Early versus definitive hematopoiesis in the 
YS and liver 


Human YS hematopoietic progenitors spanned 
two groups: HSPCs characterized by SPINK2, 
CYTLI, and HOXB9 expression and cycling 
HSPCs characterized by cell cycle-associated 
genes, such as MKI67 and TOP2A (fig. S5A and 
data S17). Using markers recently associated 
with early (DDIT4, SLC2A3, RGS16, and LIN28A) 
and definitive (KIT, ITGA4, CD74, and PROCR) 
HSPCs (12), we identified early and definitive 
fractions within both HSPCs and cycling HSPCs 
(Fig. 3, A to C). Early and definitive HSPCs ex- 
pressed canonical HSPC genes, such as SPINK2, 
HOPX, and HLF (Fig. 3A and data S17), but 
diverged in expression of genes involved in multi- 
ple processes, such as enzymes (GADI), growth 
factors (FGF23), adhesion molecules (SELL), 
and patterning genes (HOXA7) (fig. S5C and 
data S17). By logistic regression (LR), YS HSPCs 
had a high median probability of class corre- 
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spondence to liver hematopoietic stem cells 
(HSCs), but this probability was higher for 
definitive than for early HSPCs (fig. S5B). YS 
cycling HSPCs had a bipartite probability dis- 
tribution between liver multipotential progen- 
itor (MPP) and common myeloid progenitor 
(CMP), with the definitive cycling HSPC more 
MPP-like compared with the early version. Dif 
ferential protein expression in YS CITE-seq data 
indicated that CD122, CD194 (CCR4), and CD357 
mark early HSPCs, whereas CD44, CD48, CD93, 
and CD197 (CCR7) mark definitive HSPCs (fig. 
S5D and data $27), in keeping with the reported 
use of CD34 and CD44: to segregate early- and 
definitive-type HSPCs by fluorescence-activated 
cell sorting (FACS) (19). We confirmed that an 
induced pluripotent stem cell (iPSC)-derived 
culture system reported to generate definitive 
HSPCs did express RNA markers characteristic 
of definitive HSPCs (12), but an iPSC-derived 
culture system optimized for macrophage pro- 
duction (20) did not (Fig. 3A). 

To assess cross-tissue HSPC heterogeneity, we 
integrated HSPCs across hematopoietic organs 
(Fig. 3C, fig. SSE, and data S6 and S7). By kernel 
density estimation (KDE) score on integrated 
uniform manifold approximation (UMAP) em- 
beddings, YS definitive HSPCs qualitatively co- 
localized with definitive HSPCs from age-matched 
liver (Fig. 3C and fig. SSE). From exclusively 
early HSPCs at ~3 PCW, we observed rapid ac- 
cumulation of definitive HSPCs after AGM de- 
velopment CS14 (~5 PCW), likely accounting 
for the increase in the YS HSPC/progenitor frac- 
tion at 8 PCW (Fig. 1G, Fig. 3B, and fig. S5F). 

Next, we examined the transition from YS to 
liver hematopoiesis. Before AGM, the human 
embryonic liver is macroscopically pale, which 
suggests that erythropoiesis predominantly oc- 
curs in the YS (Fig. 3D). We tracked the pro- 
portional representation of hemoglobin (Hb) 
subtypes over time as a proxy for YS versus 
embryonic liver contributions. HBZ and HBEI 
(genes for Hb Gower 1) were restricted to YS 
erythroblasts, whereas HBGI (which forms 
fetal Hb/HbF in combination with an alpha 
chain) was expressed in fetal liver erythroblasts 
(21-25) (Fig. 3E and fig. S5H). The sustained 
HBZ production in YS for several days before 
liver bud formation (4 PCW) was consistent 
with a scenario where the YS supports initial 
erythropoiesis. At 7 PCW, the embryonic liver 
contained both HBZ and HBGi (Fig. 3E), in 
keeping with previous studies of Hb switching 
(8). By 8 PCW, embryonic liver erythroblasts 
expressed HBZ-repressors and were HBGI- 
dominant, as we have previously shown (10). 
By contrast, the mouse liver was macroscop- 
ically red before AGM maturation (Fig. 3D). 
Tracking Hb subtype usage in the mouse, we 
noted two waves of pre-AGM erythropoiesis— 
an initial wave with Hbb-y and Hba-zx tran- 
sitioning to Hba-al/2, and a second wave mir- 
rored in both the YS and the torso or liver 
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Fig. 2. Multiorgan functions of YS. A B 
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(C) (Columns 1 to 3) IHC staining of 
alpha fetoprotein (AFP), albumin (ALB), 
and alpha-1 antitrypsin (SERPINA1) in 
8-PCW YS (top), EL (middle), and adult 
liver (bottom). Representative images 
of n=5 YS (4 to 8 PCW), n=3 ELs (7 to 
8 PCW), and n = 3 adult liver samples. 
(Columns 4 and 5) IHC staining of EPO 
and thrombin (F2) in 7-PCW YS (top), 
7-PCW EL (middle), and healthy adult 
liver (bottom). Representative images 
from n = 3 samples per tissue: YS (4 to 
7 PCW) and EL (7 to 12 PCW). Proteins 
are brown, and nuclei are blue. (Column 
6) Martius scarlet blue (MSB)-stained D 
8-PCW EL (representative of n = 3) and 
4-PCW YS (representative of n = 3). 
Nuclei are gray, erythroid is yellow, fibrin 
is red, and connective tissue is blue 
(data S23). Scale bars, 100 um. (D) Dot 
plot showing the mean expression 
(color) and proportion of cells in YS 
endoderm; embryonic, fetal, and adult 
liver hepatocytes; and stromal cells 
from fetal kidneys. Brackets indicate 
enriched GO annotations. Green 
ellipses denote genes with prenatal 
phenotypes in homozygous-null mice. 
Solid and hollow green outlines denote 
phenotype onset before and after 

fetal liver (FL) function, respectively, as 
per fig. S4E (data S25). (E) Dot plot 
showing the mean expression (color) 
and proportion of cells expressing 
Milo-derived DEGs across gestation 
(dot size) in YS endoderm (data S24). 
Genes are grouped by function. 

(F) Schematic of the relative contributions 
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of YS (orange), liver (blue), and BM (purple) to hematopoiesis, coagulation factor, and EPO production in the first trimester of human development. 


(Hbb-btI and Hbb-bs) (fig. S5, G to I). Thus, 
there is a species-specific difference in YS 
erythropoiesis and a rapid shift in Hb usage 
after AGM development in humans. 

We examined data from human gastrulation 
(~2 to 3 PCW) and CS10 to CS11 (~4 PCW)— 
time points before AGM-HSPC formation—to 
explore the differentiation potential of early 
HSPCs. At gastrulation, the YS hematopoietic 
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landscape had a tripartite differentiation struc- 
ture, with erythroid, MK, and myeloid dif- 
ferentiation (Fig. 3F). This structure was also 
observed in the mouse YS (fig. S6, A and B, and 
data S5 and S11). Differential fate-prediction 
analysis demonstrated that early HSPCs pre- 
AGM at CS10 to CS11 (~4 PCW) were myeloid- 
biased, consistent with previous observations 
(9). However, the abundance of differentiating 
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erythroid and MK cells at CS10 to CSI1 sug- 
gested that an earlier wave of erythroid and MK 
production had occurred (Fig. 3G and fig. S6C). 
Post-AGM, the model predicted that remain- 
ing early HSPCs were erythroid- and MK-biased, 
whereas definitive HSPCs were lymphoid- and 
MK-biased (Fig. 3G). This was in keeping 
with the first appearance of YS lymphoid cells 


(ILC progenitors, NK cells, and B lineage cells) 
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Fig. 3. Early versus definitive hematopoiesis in the YS and liver. (A) Dot 
plot showing mean expression (color) and proportion of cells expressing selected 
HSPC genes (dot size) in HSPCs from YS [main and gastrulation (14)], liver 
[EL and FL (10)], AGM (76), BM (34), and iPSC cultures [IPSC (20) and definitive 
iPSC (12)]. (B) Bar chart showing proportion of early (yellow) to definitive 
HSPCs (green) in the YS scRNA-seq data grouped by gestational age. 

(C) Density plots showing YS HSPC (top) and cycling HSPC (bottom) with 
early (left) and definitive signatures (right) in an integrated landscape as per (A). 
Color indicates population z-scored KDE (data S5). Tissue contributions are 
shown in fig. SSE. (D) Representative image of whole ~4-PCW or CS12 human 
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(top; n = 4) and ~E10.5 or CS12 mouse embryo (bottom). Scale bars, 1 mm. 
(E) Line graphs showing change in erythroid cell proportion (y axis) enriched in 
globin gene expression across gestational age. Colors indicate scRNA-seq 
dataset: pink, human YS; red, matched EL. Shape size indicates cell count, scale 
indicates representative counts, and no shape indicates a count <500. 

Globins are grouped by roles in early or definitive hematopoiesis and repression. 
(F) FDG of hematopoietic cell states in the YS scRNA-seq data (n = 8, 

k = 98,738; dots) integrated with human gastrulation (14) scRNA-seq data (n = 1, 
k = 91; triangles) (left) and equivalent cell states in the mouse gastrulation 
scRNA-seq dataset (75) (n = 28, k = 4717; dots) (right). Colors represent cell 
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states, and clouds mark lineages. (G) Radial plots showing lineage transition 
probabilities between pre-AGM (CS10 to CS; left) and post-AGM (>CS14; right) YS 
early and definitive HSPCs. Color indicates population z-scored KDE. Density position 
indicates respective lineage priming probability between macrophage, lymphoid 


post-CS14 (Fig. 1G). Differential fate-prediction 
analyses suggested that iPSC-derived HSPCs 
were embryonic erythroid-, myeloid-, and MK- 
biased, whereas definitive iPSC-derived HSPCs 
were lymphoid-, MK-, erythroid-, and myeloid- 
primed, consistent with the predicted lineage 
potential of their in vivo early and definitive 
counterparts (Fig. 3H and fig. S6D). 


The life span of YS HSPCs 


HSPCs arise from hemogenic endothelium (HE) 
in the aorta, YS, BM, placenta, and embryonic 
head in mice (26-30). In the human AGM, defini- 
tive HSPCs emerge from JL33*ALDHIAI' arterial 
endothelial cells (AECs) via KCNK17*ALDHIAI" 
HE (3D. Dissecting YS endothelial cell (EC) states 
in greater detail, the broad category of PLVAP* 
ECs included AECs and HE, whereas LYVE1* 
ECs encompassed sinusoidal, immature, and 
Von Willebrand factor (VWF)-expressing ECs 
(Fig. 4A; fig. S7, A and B; and data S4 and S5). 
HE was a transient feature of early YS (Fig. 4A). 
Along inferred trajectories, YS HSPCs appeared 
to arise from AECs via HE as in AGM (72), se- 
quentially up-regulating expected genes such as 
ALDHIAI (32) (Fig. 4B). The same EC interme- 
diate states and transition points were identi- 
fied in both iPSC culture systems (Fig. 4B and 
fig. S7C). In keeping with their more recent 
endothelial origin, we found that YS HSPCs and 
AGM HSPCs, but not embryonic liver or fetal 
BM HSPCs, retained an EC gene signature char- 
acterized by the expression of KDR, CDH5, ESAM, 
and PLVAP (Fig. 4C). 

Receptor-ligand interactions capable of sup- 
porting HSPC expansion and maintenance 
in YS were predicted using CellPhoneDB (33) 
and compared with predictions in fetal BM 
(34). We identified YS ECs, fibroblasts, smooth 
muscle cells, and endoderm as likely interact- 
ing partners (Fig. 4, D to F, and data S28). YS 
ECs, like fetal BM ECs, were predicted to main- 
tain and support the HSPC pool (35) through 
the production of stem cell factor (K7TLG) and 
NOTCHTY/2, although the repertoire of NOTCH 
ligands diverged between tissues (DLLI and 
JAGI1 in YS and DLK1, JAGI/2, NOV, and DLL4 
in BM) (Fig. 4D). The YS endoderm was pre- 
dicted to support HSC pool expansion (36) 
through WNTS5A signaling to FZD3. WNT5A 
was also expressed by a wide range of BM stro- 
mal cell types, but BM HSPCs were predicted 
to respond via FZD6 rather than FZD3. All YS 
stromal fractions contributed to the extracel- 
lular matrix, which provides a substrate for 
adhesion but also modifies HSPC function, 
with FN (from all fractions) potentially ex- 
panding the HSPC pool and VTN (from the 
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endoderm) contributing to long-term HSC-like 
quiescence (Fig. 4, D and F) (87, 38). Although 
BM HSPCs were also predicted to adhere to 
extracellular matrix proteins, the integrins and 
matrix constituents differed. The YS endoderm 
was predicted to form interactions with HSPCs 
via EPO, which may influence the fate of dif- 
ferentiating progenitors (39), and THPO, which 
supports HSC quiescence and adhesion in BM 
(40). No BM stromal source of EPO or THPO 
was detectable in our data, however (10, 34). 
Thus, these anatomically different hemato- 
poietic tissues use similar pathways to support 
HSPCs, albeit with tissue-specific components. 

YS HSPC receptor to stromal ligand inter- 
actions diminished between CS17 and CS23 (4 
to 8 PCW), including the loss of cytokine and 
growth factor support and the loss of TFGB1, 
WNT, and NOTCH2 signals (Fig. 4E, fig. S7E, 
and data S29). In many interactions, there was 
a reduction in HSPC receptor expression as 
well as stromal ligand expression (Fig. 4; fig. 
S7, D and E; and data S29); yet, ligands were 
still expressed in age-matched liver and AGM 
stromal cells (fig. S7F and data S29). Adhesive 
interactions in the YS were also predicted to 
be significantly modulated (fig. S7, F and G, 
and data $29). Although aged-matched liver 
provided opportunities for adhesion with stro- 
mal cells, the AGM did not (fig. S7F and data 
$29). YS interactions gained between CS17 and 
CS23 included endoderm-derived /L73 signal- 
ing to the 7MEM219-encoded receptor impli- 
cated in the induction of apoptosis (Fig. 4D). 
Although limited conclusions can be made from 
studying cells that passed quality control for cell 
viability, we did observe up-regulation in pro- 
apoptotic gene scores in late-stage YS HSPCs, 
both early and definitive (fig. S7H). 

Despite a marked change in the stromal en- 
vironment of the later stage YS, the proportion 
of HSPC to cycling HSPC remained stable (fig. 
S5F). Differential lineage priming analysis re- 
vealed that very few HSPCs remained in CS22 
to CS23 (8 PCW) YS, and most cells were termi- 
nally differentiated (fig. S6C). Thus, it is likely 
that an early burst of early HSPC production 
arises from transient YS HE, a later influx of 
definitive HSPCs derives from AGM, and a loss 
of stromal support between 6 and 8 PCW re- 
sults in apoptosis and depletion of remaining 
HSPCs by terminal differentiation. 


An accelerated route to macrophage production 
in YS and iPSC culture 


Although YS hematopoietic progenitors are 
restricted to a short time window in early ges- 
tation, mouse models suggest that they con- 
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(NK and B lineage), erythroid, and MK terminal states. Arrows indicate proposed 
lineage priming based on KDE. (H) Radial plots showing lineage transition probabilities 
between iPSC-derived HSPCs (left) and definitive iPSC-derived HSPCs (right). 
Interpretation as in (G), with addition of embryonic erythroid terminal state. 


tribute to long-lived macrophage populations 
in some tissues (41). By scRNA-seq, transcrip- 
tionally similar macrophage populations can 
be identified in the YS and fetal brain before 
the emergence of definitive HSPCs (9). In our 
previous work, k = 6682 YS macrophages re- 
solved into two subgroups (J0). By contrast, 
our integrated dataset of k = 45,118 YS macro- 
phages in the current study revealed a greater 
heterogeneity, including premacrophages, CIQA/ 
B/C- and MRCI-expressing macrophages, and 
a rare TREM2* macrophage subset (fig. S8A). 
Promonocytes expressing HMGB2, LYZ, and 
SPI and monocytes expressing S7O0A8, SI00A9, 
and MNDA were also detected (fig. S8A). Mono- 
cytes were observed only after liver develop- 
ment and AGM-derived hematopoiesis at CS14 
(~5 PCW), but premacrophages and macro- 
phages formed as early as CS10 (~4 PCW) (Fig. 5A 
and fig. S8B). Although the potential of early 
YS HSPCs to differentiate into monocytes has 
been demonstrated in vitro (9), there were too 
few promonocytes and monocyte progenitors 
(MOPs) in our data before CS14 to reliably con- 
firm this potential. We identified two pop- 
ulations of YS monocytes, which diverged in 
expression of adhesion molecules. YS mono- 
cyte 2 expressed adhesion molecules [CAM3, 
SELL, and PLACS8 (Fig. 5B), which were also 
expressed on fetal liver but not YS HSPCs (fig. 
S5C). YS monocyte 2 had a high probability of 
class prediction against fetal liver monocytes 
(fig. S8C). Thus, monocyte 2 is likely a recircu- 
lating fetal liver monocyte, although sequential 
waves of monocytopoiesis occurring within the 
YS cannot be excluded. YS CITE-seq data were 
used to identify discriminatory markers (CD15 
and CD43 for monocyte 1; CD9 and CD35 for 
monocyte 2) and provide protein-level validation 
for differential expression of SELL (CD62L) and 
CD14 (fig. S8D and data S20). 

The YS premacrophage differentially expressed 
high levels of PTGS2, MSL1, and SPIAI as well 
as expressing progenitor genes (SPINK2, CD34, 
and SMIM24), macrophage genes (CIQA and 
MRC), and CD52, which is typically associated 
with monocytes (fig. S8A). This YS premacro- 
phage rapidly declined by 5 PCW (Fig. 5A) and 
had no equivalent in the embryonic liver (fig. 
S8C). k-nearest neighbor (KNN) graph-based 
FDG and partition-based graph abstraction 
(PAGA) suggested a direct monocyte-independent 
trajectory to YS macrophages before CS14 (Fig. 
5C and data S5). In this pre-AGM trajectory, a 
transition from HSPC to premacrophages then 
macrophages (nodes 1, 5, and 6 in Fig. 5C, up- 
per panel) fit with our predictions that pre-AGM 
HSPCs exhibit myeloid bias (Fig. 3G). After CS14, 
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Fig. 4. The life span of YS HSPCs. (A) Bar chart showing the relative proportions 
of YS EC subsets by age (PCW). sin. EC, sinusoidal endothelial cells. (B) FDG overlaid 
with PAGA showing trajectory of HE transition to HSPC in YS scRNA-seq data 

(n = 3; CS10, CSI1, and CS14; k = 2262) (top) and iPSC-derived HSPC scRNA-seq data 
(n = 7; k = 437) (20) (bottom), with feature plots of key genes (/L33, ALDHIA1) 
involved in endothelial to hemogenic transition (data S5). (€) Dot plot showing the 
mean expression (color scale) and proportion of cells expressing EC-associated genes 
(dot size) in HSPCs across gestational age (PCW). HSPCs are derived from YS (including 
gastrulation), AGM (12), matched EL and FL (10), fetal BM (34), iPSC-derived HSPC 
(20), and definitive iPSC-derived HSPC (12) scRNA-seq datasets. (D) Dot plot of the mean 
expression (color scale) and the fraction of cells expressing each gene (dot size) 

of curated genes predicted by CellphoneDB to form statistically significant (P < 0.05) 


Goh et al., Science 381, eadd7564 (2023) 18 August 2023 


IaF2) O@@OQQOOOS® 
FNI| @@@@OOo -@o 
FNI| @@@@OOo 
DLK1| @@@c ooo: 
EPo| @ : 
THPO| + 
vrn| @coo- 


KITLG 
DLLt| - 
DLLI 
JAG1 
FBNI 

WNT5A 


THPO FPO jar2 KITLG wwrsa 


F NT Fant VIN ‘ 
OT % / 

Differentiation a 
ages e Lb Self-renewal ee ee 


MPL, EPOR, IGF1R, KIT, FZD3 


Stromal cell 


EC 
Fibroblast/SM 


Endoderm 


NOTCH1/2/4 


© 20 90 


DLL1JAG1 DLK1 


protein-protein interactions between HSPCs (top) and stromal cells (bottom) across 
all time gestational points. Brackets indicate which protein counterparts form 
complexes (data S29). Data are log-normalized, variance-scaled, and min-max- 
standardized with a distribution of 0 to 1. (E) Heatmap showing curated and 
statistically significant (P < 0.05) CellphoneDB-predicted interactions between YS 
HSPCs and stromal cells that change across gestation. Color scale indicates relative 
mean expression z-scores. (F) Schematic of selected and statistically significant 

(P < 0.05) CellphoneDB-predicted interactions between YS HSPCs and endoderm, 
fibroblasts (Fib), smooth muscle cells (SMCs), or EC derived from scRNA-seq data. 
Interactions are grouped by predicted receptor to ECM interactions, ligand-receptor 
interactions, and surface-bound ligand-receptor interactions. Receptors and ligands in 
italics significantly decrease at CS17 to CS23 (6 to 8 PCW) (data S28 and S29). 
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Fig. 5. Accelerated macrophage production in YS and iPSC culture. 

(A) (Left) Line graph of monocyte and macrophage proportions in YS scRNA-seq 
across time. Dashed line indicates pre- and post-AGM stages. (Middle) Milo 
beeswarm plot showing differential abundance of YS scRNA-seq myeloid 
neighborhoods across time. Color shows degree of enrichment (blue, early; red, 
later) (data S4 and S24). (Right) Bar chart of YS scRNA-seq myeloid cell state 
proportions across time. Mono-mac int., monocyte macrophage intermediate. 
(B) Dot plot showing the mean expression (color) and proportion of cells 
expressing monocyte marker genes (dot size) in EL monocytes and YS myeloid 
cell states. Genes include YS versus EL monocyte DEGs and established 
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monocyte markers (data S17). (€) (Left) FDG of macrophage trajectory in YS 
scRNA-seq, colored by cell state, overlaid with PAGA showing monocyte- 
independent <CS14 (pre-AGM; n = 2; k = 3561; top) and monocyte-dependent 
trajectories >CS14 (post-AGM; n = 6; k = 35,962; bottom) (data S5). (Right) 
FDG overlaid with scVelocity directionality, colored by cell cycle gene enrichment 
(GO:000704 module). (D) Heatmap of regulons associated with trajectories 

in (C). TFs discussed in the text are highlighted (turquoise, premacrophage; 
purple, monocyte-dependent). (E) Dot plot showing the mean expression 
(color) and proportion of cells expressing macrophage and microglia marker 
genes (dot size) in myeloid cell states in YS, AGM (12), skin (48), gonad (49), 
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and brain (55) fetal sCRNA-seq datasets (data S13 and S31). (F) Heatmap 

of significant (P < 0.05) CellphoneDB-predicted interactions between YS 
scRNA-seq TREM2* macrophages and ECs (data S28). Color represents z-scored 
expression of gene pairs, and brackets indicate top curated interactions for 
cell-state pairs. (G) FDG of macrophage trajectory in iPSC scRNA-seq (20) 


there was a clear differentiation trajectory from 
cycling HSPC to monocytes and monocyte- 
macrophages (nodes 1 to 7 in Fig. 5C, lower 
panel). After CS14, 15.33% of this macrophage 
pool was proliferating, and CellRank RNA 
state transition analysis was in keeping with 
active self-renewal (fig. SSE and data S5). 
Using PySCENIC, YS premacrophages were 
predicted to use a group of transcription factors 
(TFs), including FL and MEF92C, that have 
been reported in the differentiation of multi- 
ple lineages (42, 43). By contrast, the monocyte- 
dependent route (CMPs, MOPs, promonocytes, 
and monocytes) relied on recognized myeloid 
TFs such as SPI1, CEBPA, and IRF8 (Fig. 5D 
and data S30). 

TREM2* macrophages expressed microglia- 
associated transcripts CX3CR1, OLFML3, and 
TREM2 and were observed in the YS only after 
CS14 (Fig. 5, A, C, and E; fig. S8A; and data S13). 
By PAGA and CellRank state transition analysis, 
TREM2* macrophages were closely aligned to 
the self-renewing macrophage population (Fig. 
5C and fig. S8E). YS TREM2* macrophages were 
located adjacent to the mesothelium, in a re- 
gion enriched by ECs (fig. S8F). CellPhoneDB 
predicted interactions between TREM2* mac- 
rophages and VWF" ECs via CXCL8 and NRPI, 
both of which are involved in angiogenic path- 
ways (44, 45) (Fig. 5F and data $28). TREM2* 
macrophages also expressed the purinergic 
receptor P2RY12, which supports trafficking 
toward adenosine 5’-triphosphate (ATP)- or 
adenosine 5’-diphosphate (ADP)-expressing 
ECs, as reported in the mouse central nervous 
system (46, 47) (Fig. 5E and data S31). To estab- 
lish whether TREM2* macrophages are present 
in other fetal tissues, we assembled an inte- 
grated 12-organ developmental atlas (fig. S8G). 
We resolved six macrophage fractions based 
on harmonized cross-tissue definitions from 
our recent prenatal immune analysis (by label 
transfer): premacrophages and TREM2* mac- 
rophages (as in our cluster-driven annotations) 
as well as LYVE?, Kupffer-like, iron-recycling, 
and proliferating macrophages (48) (fig. S8, C 
and G to J, and data S5 to S7 and S17). TREM2 is 
implicated in lipid sensing by anti-inflammatory 
tissue macrophages in the adult human and 
mouse (23-25), but we observed the highest ex- 
pression of TREM2 in macrophages bearing a 
microglia-like signature in developing tissues, 
including YS, skin [as previously reported (48)], 
gonads [as previously reported (49)], brain, and 
AGM but not BM, liver, kidney, thymus, mesen- 
teric lymph nodes (MLNs), or gut (fig. S8, I 
and J, and data S31). 
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as in (D). 


Next, we investigated whether transcriptional 
features of pre-AGM macrophages could be 
used to evaluate YS macrophage contribution 
to developing tissues. In our 12-organ macro- 
phage dataset, pre-AGM macrophages were 
compared against post-AGM macrophages in 
an integrated variational-autoencoder (VAE) 
latent space using a Bayesian differential ex- 
pression approach. The most predictive pre- 
AGM macrophage features comprised nine 
genes, including five genes in common with a 
TLF” signature identified from cross-tissue anal- 
ysis of mouse macrophages (LYVEI, TIMD4, 
FOLR2, MRC1, and NINJ1) (50) (fig. S9A and 
data S17). By KDE, macrophages significantly 
enriched in our pre-AGM module colocalized 
with LYVE!™ macrophages from gonads, liver, 
skin, and AGM and with all macrophage frac- 
tions from the YS (fig. S9, B to D; fig. SSH; and 
data S7 and S32). The proportion of pre-AGM 
module-enriched macrophages trended down- 
ward over time, even in the brain (fig. S9E). By 
transcriptome alone, it was not possible to 
separate dilution by influx of non-YS macro- 
phages from transcriptional adaptation to 
the tissue environment. With this caveat, we 
assembled a 20-organ, cross-tissue integrated 
landscape of adult tissue macrophages using 
publicly available single-cell data from the Hu- 
man Cell Atlas and Tabula Sapiens (fig. S9, F 
to H, and data S6, $7, S14, and S17). Fat, vascu- 
lature, muscle, brain, and bladder had the high- 
est proportion of macrophages enriched in the 
pre-AGM signature (fig. S9H and data S7). 

We integrated our YS gene expression data 
with scRNA-seq data from iPSC-derived macro- 
phage differentiation (2 = 19; & = 50,512) (20) 
after refining the annotations of iPSC-derived 
cell states (fig. S10, A to C, and data S5 and 
$13). Nonadherent, CD/4-expressing cells ap- 
pearing after week 2 of differentiation expressed 
CIQA, CIQB, and APOC] in keeping with a macro- 
phage identity, whereas CD14-, CD52-, FCNI-, and 
S100A8/9-expressing monocytes only emerged 
after week 3 (fig. S10, D and E). Before mono- 
cyte emergence, a monocyte-independent mac- 
rophage differentiation trajectory was observed, 
consistent with previous observations (20) (Fig. 
5G and fig. S10C). TF regulatory profiles of 
iPSC-derived macrophage differentiation were 
consistent with both the premacrophage and 
monocyte-dependent TF profiles inferred from 
our YS data, including usage of MEF2C, SPI], 
CEBPA, and IRF8 in iPSC-derived premacro- 
phages (Fig. 5H and data S30). However, neither 
iPSC culture system could recapitulate the het- 
erogeneity of macrophages seen in native tissues 
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colored by cell state, overlaid with PAGA showing monocyte-independent 

<D21 (n = 5; k = 779; left) and monocyte-dependent >D21 (n = 7; k = 8553; right) 
transitions (data S7 and S5). (H) Heatmap of regulons associated with iPSC 
macrophage trajectories shown in (G). TFs discussed in the text are highlighted, 


(fig. SIOE), which suggests that interactions 
with stromal cells, such as ECs, may be required 
to acquire specific molecular profiles. 


Discussion 


Using single-cell multiomic and imaging tech- 
nologies, we delineate the dynamic composi- 
tion and functions of human YS in vivo from 
3 PCW, when the three embryonic germ layers 
form, to 8 PCW, when most organ structures 
are already established (27). Although the scar- 
city and small sample size necessitated a pri- 
marily computational approach, we deliver a 
comprehensive resource. LR and VAE models 
provided by our data will facilitate future use 
of our YS atlas to map scRNA-seq datasets 
(51, 52), empowering future mechanistic per- 
turbation and lineage-tracing experiments in 
iPSCs and model systems. 

We detail how the YS endoderm shares meta- 
bolic, biosynthetic, and erythropoiesis-stimulating 
functions with the liver. In part, this shared 
functionality may relate to their common role 
in creating a hematopoietic niche (53). We iden- 
tify differences in the handover from YS to liver 
hematopoiesis between species. In mice, ery- 
throid progenitors in the YS mature before the 
onset of circulation, but erythromyeloid pro- 
genitors can exit the YS and mature in the fetal 
liver, giving rise to long-lived populations, such 
as fetal liver monocyte-derived macrophages. 
We show that in human YS, active differentia- 
tion of erythroid and macrophage cells occurs 
for several weeks before liver handover, and, at 
least in terms of erythropoiesis, there is a rapid 
transition from YS erythroid production to 
embryonic liver erythroid production shortly 
after AGM-derived HSPCs emerge. In a land- 
mark study on human Hb switching, directly 
labeled 6-PCW liver and YS erythrocytes con- 
tained embryonic Hb subunits (e and ¢), but 
colonies derived from liver and YS progenitors 
at this time produced fetal Hb subunits (a 
and y) (8). This is in keeping with YS-derived 
erythrocytes recirculating throughout the 
embryo and membranes while a post-AGM 
progenitor is preparing for liver erythropoiesis. 
Direct evidence that human liver erythropoiesis 
is supplied predominantly from AGM-derived 
HSPCs rather than a YS-derived EMP-like pro- 
genitor is still lacking. Future studies are needed 
to examine the handover of macrophage pro- 
duction from early to definitive sources in hu- 
mans, which may question the primacy of mouse 
models of early myelopoiesis. A more expansive 
species reference, including rabbits with their 
greater early gestational similarity to humans, 
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will facilitate selection of appropriate models 
for genetic manipulation and functional vali- 
dation (54). 

The developmental window investigated in 
this work encompasses hematopoiesis from 
HSPCs arising both within the YS and within 
the embryo proper. We reconstructed YS HSPC 
emergence from a temporally restricted HE, 
featuring similar transition states and molec- 
ular regulation to AGM HSPCs. By gastrulation 
(CS7; 2 to 3 PCW), YS HSPCs already differ- 
entiate into erythroid, MK, and myeloid lin- 
eages. Building on a recent compilation of gene 
scorecards that characterize early and defini- 
tive HSPCs (72), we were able to parse the two 
fractions and document the transition to de- 
finitive HSPC dominance after CS14 (~5 PCW). 
This separation also allowed us to identify an 
early HSPC bias toward myeloid, erythroid, 
and MK lineages and a definitive HSPC bias 
toward MK and lymphoid lineages. Both early 
and definitive YS HSPCs became more quies- 
cent and up-regulated apoptosis-related genes 
between CS17 and CS23 (~6 to 8 PCW). Stromal 
cell ligands predicted to support HSPCs were 
markedly disrupted during this time, which 
suggests that the barriers to YS HSPC survival 
may be extrinsic. 

Early HSPCs use an accelerated route to mac- 
rophage production independent of monocytes. 
Both accelerated and monocyte-dependent 
macrophages were recapitulated during in vitro 
differentiation of iPSCs, but diverse macro- 
phage subtypes, such as TREM2* macrophages, 
were not. TREM2* macrophages, which are 
transcriptionally aligned with brain microglia, 
fetal skin, testes, and AGM TREM2* macro- 
phages, were predicted to interact with ECs, 
potentially supporting angiogenesis, as has 
been described in the mouse brain (55). 

There is a growing appreciation of the po- 
tentially life-long consequences of early devel- 
opmental processes. Our study illuminates a 
previously obscure phase of human develop- 
ment, where vital organismal functions are 
delivered by a transient extraembryonic organ 
using noncanonical cellular differentiation path- 
ways that can be leveraged for tissue engineer- 
ing and cellular therapy. 


Materials and methods 
Ethics and sample acquisition 


Tissues were obtained from the MRC-Wellcome 
Trust-funded Human Developmental Biology 
Resource (HDBR; https://www.hdbr.org) with 
appropriate written consent and approval 
from the Newcastle and North Tyneside NHS 
Health Authority Joint Ethics Committee (18/ 
NE/0290). HDBR is regulated by the UK Hu- 
man Tissue Authority (HTA; www.hta.gov.uk) 
and operates in accordance with the relevant 
HTA Codes of Practice. Tissues used for light- 
sheet fluorescence microscopy were obtained 
through INSERM’s HuDeCA Biobank and made 
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available in accordance with the French bylaw 
(Good practice concerning the conservation, 
transformation and transportation of human 
tissue to be used therapeutically, published 
on 29 December 1998). Permission to use hu- 
man tissues was obtained from the French 
agency for biomedical research (Agence de la 
Biomédecine, Saint-Denis La Plaine, France). 

Embryos were staged using the Carnegie stag- 
ing method (56). A piece of skin or chorionic 
villi tissue was collected from each sample to 
perform quantitative PCR karyotyping of sex 
chromosomes and autosomal chromosomes 
13, 15, 16, 18, 21, and 22 for the most commonly 
seen chromosomal abnormalities. No abnor- 
malities were detected. 


Processing samples for imaging and 
single-cell sequencing 


Tissues were transported in phosphate-buffered 
saline (PBS) on ice, were dissected within 24 hours, 
and were processed immediately (<1 hour after 
dissection). For formalin-fixation and paraffin- 
embedding, samples were immediately placed 
in 10% (w/v) formalin. Processing and embed- 
ding were performed by NovoPath, Newcastle 
upon Tyne NHS Trust. For RNAscope, samples 
were snap-frozen in an isopentane bath in liq- 
uid nitrogen before embedding in optimal cut- 
ting temperature (OCT) compound. Single-cell 
suspensions were generated by dicing tissue into 
segments <1 mm’, followed by enzymatic diges- 
tion for 30 min at 37°C with intermittent shak- 
ing. Digestion media was 1.6 mg/ml collagenase 
type IV (Worthington) in RPMI (Sigma-Aldrich) 
supplemented with 10% (v/v) heat-inactivated 
fetal bovine serum (FBS; Gibco), 100 U/ml of 
penicillin (Sigma-Aldrich), 0.1 mg/ml of strep- 
tomycin (Sigma-Aldrich), and 2 mM L-glutamine 
(Sigma-Aldrich). Digested tissue was passed 
through a 100-um filter, and cells were collected 
by centrifugation (500g for 5 min at 4°C). Cells 
were treated with 1X red blood cell (RBC) lysis 
buffer (eBioscience) for 5 min at room temper- 
ature and washed once with Flow Buffer [PBS 
containing 5% (v/v) FBS and 2 mM EDTA] be- 
fore counting. Processing for scRNA-seq was 
continued promptly on fresh cells, for other 
uses (including CITE-seq) cells were collected 
by centrifugation (500g for 5 min at 4°C) and 
resuspended in 10% (v/v) dimethyl sulfoxide 
(DMSO) in FBS for freezing. For light-sheet flu- 
orescence microscopy, tissues were fixed in 4% 
paraformaldehyde (PFA) and dissected. Ges- 
tational age was then estimated as previously 
described (57). 


Processing of single-cell suspensions for scRNA-seq 


Immediately after isolation and counting, cells 
were collected by centrifugation (500g for 5 min 
at 4°C) and resuspended in a residual buffer. 
Three microliters of CD45 BUV395 (clone: HI30, 
BD Biosciences) was added to the resuspended 
cells and incubated on ice in the dark for 30 min, 
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washed with Flow Buffer and resuspended at 
~1x 10’ cells/ml. Immediately before sorting, 
cells were passed through a 35-um filter (Falcon) 
and 4’,6-diamidino-2-phenylindole (DAPI) (Sigma- 
Aldrich) was added at a final concentration 
of 3 uM. Flow sorting was performed on a BD 
FACSAria Fusion instrument using DIVA v.8, 
and data were analyzed using FlowJo (v.10.4.1, 
BD Biosciences). Cells were gated to exclude 
dead cells and doublets, and then isolated for 
scRNA-seq analysis (droplet-based 10x Genomics, 
or plate-based Smart-seq2) using a 100-1m noz- 
Zale. For droplet-based scRNA-seq, CD45* and 
CD45" cells were sorted into separate chilled 
FACS tubes coated with FBS and prefilled with 
500 ul of sterile PBS. For plate-based scRNA-seq, 
CD45 AF*SSC™* single cells were index-sorted 
into 96-well LoBind plates (Eppendorf) con- 
taining 10 ul of lysis buffer [TCL (Qiagen) + 1% 
(v/v) B-mercaptoethanol] per well. 


Library preparation and sequencing 
of scRNA-seq and CITE-seq samples 


For the droplet-based scRNA-seq experiments, 
cell suspensions isolated by FACS were counted 
and loaded onto the 10X Genomics Chromium 
Controller to achieve a maximum yield of 10,000 
cells per reaction. 5’ V1 kits were used and se- 
quencing libraries were generated according 
to the manufacturer’s protocols. Libraries were 
sequenced using either an Illumina HiSeq 4000 
or NovaSeq 6000 to generate at least 50,000 raw 
reads per cell. 

For the plate-based scRNA-seq experiments, 
the frozen cell lysates were thawed on ice for 
1 min. Purified cDNA was generated and am- 
plified using a modified Smart-seq 2 protocol 
described in Villani et al. (58). Sequencing li- 
braries were then generated using Illumina 
Nextera XT kits with v2 index sets A, B, C, and 
D. 384 cells were pooled and were sequenced 
using a HiSeq 4.000 to generate at least 1 x 10° 
raw reads per cell. 

For the CITE-seq experiments, frozen cells 
were thawed, counted, and pooled. Fe block- 
ing reagent (Biolegend) was added to the cell 
pools and left to incubate at room temperature 
for 10 min. Five hundred nanoliters of CD34 
APC/Cy-7 (clone: 581, Biolegend) was then added 
to the Fc-blocked cells and left to incubate in the 
dark and on ice for 10 min. During this incuba- 
tion, the CITE-seq antibody cocktail (Biolegend) 
(see data S33) was centrifuged at 14,000g for 
1 min. Flow buffer was then added to reconsti- 
tute before incubating for 5 min at room tem- 
perature. The resuspended antibody cocktail 
was then centrifuged at 14,000g for a further 
10 min before adding to the cells. The cells and 
CITE-seq antibody cocktail were then left to 
incubate for 30 min in the dark and on ice. 
After this time, the cells were washed twice 
with Flow buffer and resuspended in a final 
concentration of 50 ug/ml of 7 AAD (Thermo 
Fisher Scientific) in Flow buffer. 
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Live, single cells or live, single CD34 cells 
and live, single CD34" cells (for the CITE-seq 
experiments) were then isolated by FACS into 
500 ul of PBS in FACS tubes coated with FBS. 
Cells were then counted and submitted to the 
CRUK CI Genomics Core Facility for subse- 
quent processing using 10x Genomics protocols 
and sequencing. Single-cell gene expression 
and cell surface protein libraries were gen- 
erated using single-cell 3’ v3 kits according to 
the manufacturer’s protocol. Libraries were 
sequenced using a NovaSeq 6000 to achieve 
a minimum of 20,000 reads per cell for gene 
expression and 5000 reads per cell for cell- 
surface protein. 


Alignment, quality control, filtering, and 
preprocessing of scRNA-seq and CITE-seq data 


scRNA-seq expression data (including droplet- 
based and plate-based) were mapped with 
CellRanger (version 3.0.2) to a human reference 
genome (see data S1) and low-quality cells ex- 
pressing <2000 reads, <200 genes, and >20% 
mitochondrial reads were filtered out of the 
data. Data on genes expressed in fewer than 
three cells was removed. 

For droplet-based scRNA-seq data, the follow- 
ing additional QC steps were performed. Scrub- 
let (59) v0.2.3 was applied to each sequencing 
lane for doublet detection, and clusters with > 
[median + (1.48 x MAD)] (where MAD is the 
median absolute deviation) of the median clus- 
ter doublet detection score were removed (data 
$3). Ambient RNA was removed with Cell- 
bender (v0.2.0) with false discovery rate (FDR) = 
0.01 and epochs = 150 (60). To determine like- 
lihood of maternal contamination, data were 
pooled by donor and submitted to Souporcell 
(v2.4.0) at genotype clusters & = 1 and k = 2 
models to represent likelihood of no maternal 
contamination and possible maternal contam- 
ination, respectively. The optimal model was 
identified via BIC (Bayesian Information Cri- 
terion), where we observed a smaller BIC in- 
dex at & = 2 in one donor (F37, Female, 5 PCW). 
Cells from the F37 alternate genotype were 
identified as potential maternal contaminants, 
composed mainly of monocytes (n = 149), and 
monocyte-macrophage intermediates (mono- 
mac int.) (m = 25), and excluded from down- 
stream analysis. 

For CITE-seq data, FASTQ alignment was 
performed for multiplexed RNA lanes with 
CellRanger (v4.0.0) and GRCh38-2020-A ref- 
erence genome, and for multiplexed protein 
lanes with CITE-seq-Count (v1.4.3). Lanes with 
cells pooled from multiple donors were decon- 
voluted using Souporcell singularity image at 
https://github.com/wheaton5/souporcell. Low- 
quality cells expressing <200 genes and >20% 
mitochondrial reads were removed, and dou- 
blets were removed by applying Scrublet v0.2.3 
to each sequencing lane and then removing 
clusters with >[median + (1.48 x MAD)] of the 
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median cluster doublet detection score. CITE-seq 
protein data underwent QC and preprocess- 
ing as previously described (34) (i.e., cells 
were first filtered to intersect barcodes with 
counterpart CITE-seq RNA data, then unmapped 
antibodies were filtered out, and then protein 
cells were filtered for low quality by cells with 
<30 proteins and expressing >5000 reads) 
(data S4, S19, and S22). 

scRNA-seq count matrix transformation, nor- 
malization, and preprocessing were performed 
using Scanpy (67) (v1.9.0) in python (v3.8.6). We 
normalized raw gene counts using the sc.pp. 
normalize_total function (target_swm = 10e4) 
and performed /n(w#)+/ transformation. Re- 
ported expression values were normalized, 
log-transformed, and scaled to variance of mean 
using the sc.pp.scale function independently 
for each analysis. 

For CITE-seq data count matrix transforma- 
tion, we first performed denoised and scaled 
by background (DSB)-normalization and ap- 
plied a Gaussian mixture model (GMM) for 
background nonspecific binding signal regres- 
sion per sample as previously described (62). 
For the first step, a modified DSB-normaliza- 
tion approach used in our previous study (34) 
was constructed. For each CITE-seq lane, low- 
quality/empty droplets were identified as drop- 
lets under the largest UMI peak which had a 
value <1.96 x standard deviations (std) of the 
mean UMI counts value (mu_UMTI) per sam- 
ple. Peak detection was conducted using the 
scipy.signal find_peaks function. The num- 
ber of peak detection bins were dynamically 
estimated as [3.322 x log(X)], where X was the 
total number of droplets. The model iterated 
through a series of 20 prominence intervals (0 
to 20) with widths (0 to 10) where peaks de- 
tected < [mu_UMI-(1.96 x std)] were retained as 
empty droplet peaks. In cases where no empty 
droplet peaks were detected, the empty droplet 
threshold was taken to be <[mu_UMI - (1.96 x 
std)]. The estimated empty droplets matrix was 
then taken into downstream DSB normaliza- 
tion in the same way as our previous study (34). 
For the second step of CITE-seq matrix trans- 
formation, we trained a GMM to model the 
variance of protein expression levels in each 
cell. We used the sckitlearn (v.1.1.3) sklearn. 
miature.GaussianMixture module to fit 20 mod- 
els with an increasing number of cell clusters 
k (between & = 2 and k = 21) to represent 
expression patterns of each protein by cell. The 
optimal model was identified using BIC (BIC; = 
2L; + kilogm) and AIC (Akaike information cri- 
terion) (AIC; = 2L,; + 2k;) metrics, where k is 
the number of GMM cell protein expression 
clusters, n is the number of cells in the sample, 
and L is the model log likelihood. The models 
with the best performing BIC and AIC scores 
were selected. The mean expression values of 
GMM clusters with lowest expression values 
from each GMM model were interpreted as mean 
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background expression per protein. Background- 
signal regression was then carried out using a 
Gaussian linear model (GLM) per protein, con- 
structed using the GL function from statsmodels 
(v0.13.5) on standardized, per-cell background 
scores (BG_score). Per-cell background scores 
were defined by taking the euler number I to 
the power of each protein background mean 
(mu_bg), divided by e to the power of the pro- 
tein expression in each cell (x), then scaled to a 
distribution between 0 and 1 by subtracting 
the minimum score and subsequently dividing 
by the maximum score. Background scores in- 
versely correlate with the magnitude of back- 
ground expression per cell. BG_score = [score — 
min(e™ 8 /e*)]/max(e™"2/e*). The per-cell 
background signal regressed counts were used 
for subsequent analyses, interpretation, and 
visualization. Cells comprising the empty drop- 
let matrix were removed and were not consid- 
ered for downstream analyses. 


Integration and batch correction of scRNA-seq 
and CITE-seq datasets 


For integration of newly generated YS scRNA- 
seq data with external datasets, CellRanger 
count was first reapplied for the alignment of 
CS10/CS11 and CS14 embryonic YS scRNA-seq 
data previously acquired (72, 63) (data S1). The 
following steps were then followed for the 
total integrated YS droplet-based scRNA-seq 
dataset. Highly variable gene (HVG) selec- 
tion was performed using the sc.pp.highly_ 
variable_genes function (min_mean = 0.001, 
max_mean = 10) for embedding by dispersion. 
Dimensionality reduction and batch correction 
was carried out using the scVI module within 
scvi-tools (v0.19.0) (52) as used in scvi-tools (57) 
(AVG = 7500, dropout_rate = 0.2, n_layer = 2) 
with biological replicate taken as the technical 
covariate. To ensure model performance was 
optimal for each independent analysis, scVI 
was benchmarked against the python imple- 
mentation of Harmony (64) (Harmonypy v0.0.5) 
at various theta values between 1 and 20. KBET 
(65) and Silhouette scores (sklearn.metric.sil_ 
score) were computed for each iteration be- 
tween donor covariates and compared with 
the scVI integration. For integration of adult 
scRNA-seq data, publicly available single-cell 
and single-nuclei RNA-seq data of 20 healthy 
adult tissues (data S6 and S7) were integrated 
using scVI (HVG = 1500, layers = 1). Batch cor- 
rection was conducted on donors, single cells 
or single nuclei, data source, number of genes, 
total counts, percentage of mitochondrial genes 
and ribosomal genes. See data S6 for infor- 
mation regarding external scRNA-seq datasets 
that have been incorporated and integrated in 
this study. 

For multimodal integration of CITE-seq data- 
sets, we compared the integration of both RNA 
and protein modalities using the totalVI module 
in scvi-tools (v0.19.0) against batch integration 
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using only the RNA modality the scVI module in 
scvi-tools. We performed sc.pp.highly_variable_ 
genes function on RNA modality (HVG = 4000) 
accounting for FACs sampling and donor tech- 
nical covariates. We then generated a multimodal 
totalVI VAE latent representation following 
totalVI pipeline (62). To benchmark performance 
between multimodal and single-modality scRNA- 
seq data integration, global silhouette distances 
between leiden clusters (res = 3) derived from 
the multimodal totalVI VAE latent represen- 
tation were compared against clusters derived 
from the scVI derived VAE representation as 
described above (HVG = 4000, dropout_rate = 
0.2, n_layer = 2) (fig. S11A). Intersecting cells 
captured between protein and RNA modalities 
(by barcode) were considered in the totalVI in- 
tegration (fig. S11, B to D, and data S4 and S22). 

All scVI VAE and IdVAE models trained on 
the YS and integrated atlases are available on 
our data portal (see Data and materials avail- 
ability) and will facilitate transfer learning for 
future reference mapping of scRNA-seq data with 
single-cell architectural surgery (scArches) (/8). 


Clustering and annotation of scRNA-seq 
and CITE-seq data 


Clustering of sCRNA-seq and CITE-seq datasets 
was performed using the Leiden algorithm (66) 
(sc.tl.leiden) with a resolution parameter of 
res = 1.5 (CITE-seq res = 3) on a KNN graph 
(k = 30 for scRNA-seq and & = 15 for CITE-seq) 
unless specified otherwise. To measure the ef- 
fect of decreasing graph complexity on specific 
populations (YS scRNA-seq endoderm and 
IPSC-derived macrophages) and global pop- 
ulation specificity and homogeneity in each 
independent analysis, the neighborhood graph 
complexity parameter (/) was benchmarked at 
value intervals between 5 and 50. Benchmarked 
metrics for population specificity included the 
adjusted mutual information score (MI) and 
adjusted Rand index (RAND). Metrics for pop- 
ulation homogeneity included the silhouette 
index (SD) and within-cluster sum of squared 
errors (WSS) (fig. $12, A and B, and data S3 
and S7). In cases where datasets are compared 
probabilistically, or where new classifications 
have been made, an implementation of low- 
dimensional ElasticNet regression (EN) (de- 
scribed in the “Cell state predictions using 
probabilistic low-dimensional ElasticNet re- 
gression” section of the Materials and meth- 
ods) was used to first classify individual cells 
where a model-specific decision threshold of 
0.9 was used for classification tasks. Cells clas- 
sified inherited labels from the model trained 
on YS scRNA-seq data. Clusters were then as- 
signed classes if the majority projected label 
had a label count distribution of >[mean + (1 x 
std)] of label counts per cluster. Resultant cell 
state classifications were further manually 
checked using DEGs using the sc.tl.rank_genes_ 
groups function in Scanpy which performed 


Goh et al., Science 381, eadd7564 (2023) 


a two-sided Wilcoxon rank-sum test for genes 
expressed in >25% of cells, with a log-transformed 
fold change cutoff of 0.25. All P values were ad- 
justed for multiple testing using the Benjamini- 
Hochberg method. Annotation of YS and liver 
CITE-seq data was performed by training an EN 
model using YS and embryonic liver sCRNA-seq 
datasets as references, respectively. These labels 
were then distributed by majority voting onto 
Leiden clusters derived from CITE-seq data 
(data S9 and S10). The resultant cluster anno- 
tations were validated using the same markers 
identified in matched RNA data and underwent 
additional manual annotation where required. 
For differential expression testing of surface 
proteins in multimodal CITE-seq data, we con- 
ducted a one_vs_all DE test using the vae. 
differential_expression module within totalVI 
(Bayes factor > 3, median LFC > 0.25). Marker 
proteins and corresponding populations were 
subsequently subject to hierarchical grouping 
using the sc.tl.dendrogram functionality with- 
in Scanpy (fig. S11, B to D, and data S4, $20, 
and S22). Bayesian differential expression 
testing between cell states in the integrated 
12 fetal organ atlas was carried out using a 
scVI integrated latent VAE representation with 
a one_vs_all DE test using the vae.differential_ 
expression module within scVI (v0.19.0) (Bayes 
factor > 3, median LFC > 4). Variation of effect- 
sizes on state-specific normalized counts be- 
tween latent variables in the integrated latent 
VAE representation were first modeled. The 
posterior likelihood of differential expression 
was attained by repeated one_vs_all sampling of 
the variational distribution. Significant features 
were defined with a likelihood of differential ex- 
pression (Bayes factor) >3 and median LFC >4. 
Bayesian differential expression testing between 
myeloid cell states in the integrated 20 organ 
adult scRNA-seq atlas was carried out as de- 
scribed above (fig. S12C and data S6, S7, and S17). 


Dimensionality reduction and marker 
expression visualization 


For visualization, the UMAP algorithm was 
run using the sc.tl.wmap function in Scanpy. 
Dot-plots and violin plots were produced in 
Scanpy and all gene expression values dis- 
played were normalized, log-transformed, and 
scaled as described in the preprocessing sec- 
tion unless otherwise stated. Dot plots that 
display data from multiple datasets used in- 
dependent log-normalization, variance scaling, 
and min-max standardization to a distribution 
of O to 1 per dataset unless otherwise stated. 
FDGs computed with the sc.tl.draw_graphs 
function in Scanpy using the Force Atlas2 pa- 
rameter were used to infer trajectories. PAGA 
were computed on the KNN graphs and over- 
laid onto FDGs where nodes represented the 
centroid of each cell state cluster and the thick- 
ness of edges represented the similarity between 
cell states (data S5). 
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Proportion line graphs for specific popula- 
tions (e.g., erythroid cells) enriched in specific 
genes (e.g., HBZ) using the sc.tl.enrich func- 
tion in Scanpy were produced using Matplotlib 
(v3.6.2). To ensure temporal changes in pop- 
ulation size and background expression were 
accounted for, we segregated our population 
of interest by age and computed changes in 
relative population proportion enriched in each 
gene, only considering cells expressing >0 log- 
normalized counts for each gene. Enriched cells 
were defined with >0 score of each scored gene 
subtracted with the mean expression of a ran- 
domly sampled set of 200 selected reference 
genes at 50 bins using the aforementioned 
enrichment function in Scanpy. Proportions 
of enriched cells in each cell type compartment 
were then plotted as a discrete time series across 
gestational age to visualize differential enrich- 
ment of cells expressing the genes of interest. 
Data point sizes represented enriched cell counts. 
To aid interpretation, an ordinal scale of repre- 
sentative cell counts was included as a legend 
in the plots. 

Proportions of specific populations (e.g., mac- 
rophages) enriched in specific gene modules 
(e.g., pre-AGM module) were visualized using 
violin graphs produced using Matplotlib and 
Seaborn (v0.12.1) python libraries. To ensure 
background expression profiles were accounted 
for, we segregated our population of interest 
and computed changes in relative population 
proportion enriched in each gene module. Sig- 
nificant module enrichment was defined as 
described above. Enriched cells from each cell 
type compartment were then graphed across 
organs. Enrichment scores were standardized 
to the median by subtraction of the median 
and subsequent division by MAD. 


Differential abundance testing and 
FACS correction 


We tested for differential cell-state abundance 
across gestation using the Milo framework 
(67), correcting for CD45 positive and negative 
FACS isolation strategies using a previously 
published technique (48). Where FACs correc- 
tion was applied, we calculated a FACS isola- 
tion correction factor for each sample s sorted 
with gate 7 as [fs = log(p,S/S;)] where p; is the 
true proportion of cells from gate 7 and S rep- 
resents the total number of cells from both gates. 
A KNN graph was then constructed from the 
remaining cells using the milopy.core.make_ 
nhoods function (prop = 0.05). Neighborhood 
labels were determined by majority voting of 
cell labels by frequency in each neighborhood 
(>50%). The YS scRNA-seq data was then split 
into five age bins (3 PCW, 4 PCW, 5 PCW, 7 PCW, 
and 8 PCW) and cell counts were modeled as a 
negative binomial generalized linear model 
(NB-GLM) with Benjamini-Hochberg weighted 
correction as previously described (48). Signif- 
icantly differentially abundant neighborhoods 
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were detected by SpatialFDR- (<0.1, logFC < 0) 
for early enriched neighborhoods and (<0.1, 
logFC > 0) for late neighborhoods (data S24). 
Beeswarm plots were generated using the 
ggplot2 library (v3.4.2). Each node represents 
an independent neighborhood of cells derived 
from the KNN graph. The 2-axis position of 
each node represents the fold-change (positive 
or negative) associated with the distribution 
of age groups present in each neighborhood 
where larger proportions of older groups in 
a given neighborhood encourages a positive 
fold change and vice versa. Colored nodes rep- 
resent neighborhoods with significant enrich- 
ment (P < 0.05 spatial FDR) and the intensity 
represents the degree of significance. 


Clustered gene set enrichment analysis 


We ranked conserved markers (P < 0.05) between 
the endoderm cell state in YS scRNA-seq data 
against hepatocytes in embryonic liver scRNA- 
seq and endoderm in the mouse gastrulation 
scRNA-seq data using the FindConservedMarkers 
function in Seurat (v3.1) with Bonferroni-corrected 
FDR-adjusted P values. Markers were submitted 
for gene set enrichment ranking and analysis 
using the Enrichr tool as implemented in the 
GSEApy (v1.0) package to query the Gene On- 
tology (GO) Biological Process database (GO_ 

BP_2022) (data S26). Using the enrichrR pack- 
age (v3.0) in R, enrichment was first computed 
by Fisher exact test for randomly sampled genes 
to derive a mean rank and standard deviation 
to estimate background for each ontological 
term accessed. A z-score for deviation of each 
term to its background rank was then used to 
rank output genesets. We derived statistical 
significance (Fisher exact test < 0.05, ranked 
by z-score) for each gene set enrichment and 
performed Markov clustering (MCL) using the 
MCL (V1.0) package in R to derive network neigh- 
borhoods based on geneset intersect. Gene set 
clusters were annotated using the AutoAnnotate 
function in the RCy3 (v2.16) package and clus- 
ters were ranked by the mean z-score of all gene 
sets within each cluster and manually curated 
based on biological relevance. We used the 
Cytoscape software (v3.9.1) to visualize clusters. 


Cell state predictions using probabilistic 
low-dimensional ElasticNet regression 


Label transfer class assignments and median 
probability of class correspondence between 
gene expression matrices in single-cell datasets 
were catried out using a LR framework, as pre- 
viously described (34), using a similar workflow 
to the CellTypist tool (68). 

Raw scRNA-seq datasets being compared 
were first concatenated, normalized, and log- 
transformed, as described in preprocessing. 
HVG selection was performed (min_mean = 
0.001, max_mean = 10) for embeddings by dis- 
persion. HVG expression matrices were used 
as training inputs for models unless otherwise 
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stated. For models trained in combined low- 
dimensional representations, linear VAE la- 
tent representations were computed using the 
LDVAE module within scvi-tools (hidden layers = 
256, dropout-rate = 0.2, reconstruction-loss = 
negative binomial) with donor, dataset origin, 
and organ information taken as technical covar- 
iates. Where PCs were used as input for train- 
ing, harmony batch-corrected PCs (& = 100 pcs) 
were used, using Harmonypy (v0.0.9) with tech- 
nical covariates as described above. Harmony 
runs were iterated through theta = 1:20 and 
resultant embeddings benchmarked using kKBET 
and silhouette scores between technical co- 
variates where a low kBET rejection rate and 
corresponding high silhouette score denoted 
the optimal theta parameter. 

ElasticNet regression (EN) LR models were built 
using the sklearn.linear_model.LogisticRegression 
module in the sklearn package (v0.22). The mod- 
els were trained using either gene expression 
data or SCVI batch-corrected low-dimensional 
LDVAE representation of the training data with 
regularization parameters (Ll-ratio and alpha) 
tuned using the GridSearchCV function in sklearn 
(v1.1.3). The test grid was designed with five 
l1_ratio intervals (0, 0.2, 0.4, 0.6, 0.8, 1), five alpha 
(inverse of regularization strength) intervals (0.2, 
0.4, 0.6, 0.8, 1) at five train-test splits and three 
repeats for cross-validation. The unweighted 
mean over the weighted mean squared errors 
(MSEs) of each test fold (the cross-validated MSE) 
was used to determine the optimal model. 

The resultant model was used to predict the 
probability of correspondence between trained 
labels and precomputed clusters in the target 
dataset. To ensure that probabilistic outputs 
from LR models remained consistent with ob- 
served neighborhood graph connectivities, the 
median LR predicted probability of training 
label assignment was compared against nor- 
malized graph distances between classes com- 
puted using the PAGA (tLpaga) module in Scanpy 
as described in our previous work (JO) (fig. 
S13A). Genes predicted to be significantly dis- 
criminatory for each LR model were assessed by 
significance of impact. Features were ranked in 
descending manner by impact score (e’\coefficient 
per feature for given intercept). Impact signif- 
icance (P < 0.05) of each gene was computed by 
the survival function (sf) across all gene impact 
scores (fig. S13B). To further verify the specificity 
of the TREM2 macrophage gene expression pro- 
file, the proportion of DEGs overlapping be- 
tween the TREM2 macrophage population and 
other macrophage populations across the 12- 
organ fetal atlas were computed using a two- 
sided Wilcoxon rank-sum test as described in 
the clustering and annotation section (fig. S13, 
C and D, and data S31). 

For dataset comparisons across the 12-organ 
fetal atlas where predesignated labels already 
existed in the target dataset, the median prob- 
ability of training label assignment per pre- 
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designated class was computed. The resultant 
LR probabilistic relationship between labels of 
the 12-organ atlas were visualized as a heatmap 
(figs. S14 and S15 and data S8 to S16). 

For classification tasks, a model-specific de- 
cision threshold of 0.9 was used to determine 
predicted labels. Clusters were then assigned 
classes if the majority projected label had a label 
count distribution of >[mean + (1 x std)] of label 
counts per cluster. Resultant cell state classi- 
fications were further manually checked using 
DEGs. Further assessment of the predicted clus- 
ter labels was carried out by computing the ad- 
justed Rand index and mutual information scores 
from the modules sklearn.metrics.adjusted_ 
rand_score and sklearn.metrics.mutual_info_ 
score between the original cluster labels and 
predicted cluster labels in each dataset. This 
methodology was applied to classify and an- 
notate several external datasets including the 
scRNA-seq human gastrulation data (74), the 
human AGM data (72), the human embryonic 
liver data and human fetal skin data (48), as 
well as the human YS and liver CITE-seq data 
(data S6). 

An implementation of the EN workflow de- 
scribed above, in conjunction with the SAMap 
(self-assembling manifold mapping) workflow 
(v1.0.7) (69), was used to classify and probabil- 
istically compare cell states across the human 
YS scRNA-seq data and the mouse gastrula- 
tion YS data. A gene-gene sequence homology 
graph weighted by human and mouse sequence 
similarity was first constructed using the SAMAP 
tool. Reciprocal BLAST mapping using the tblastx 
tool between the entire mouse and human tran- 
scriptomes for significant homology (E value < 
10°) was supplied. The resultant SAM object 
returned k = 300 species-stitched PC compo- 
nents for the top 3000 paired genes. These PC 
components were used to train the cross-species 
EN model as a classification task described above 
(data S11). 

LR models and weights trained on the YS and 
integrated fetal atlases are available via our 
interactive web portal in “sav” format (see Data 
and materials availability) and will facilitate fu- 
ture use of our YS atlas for label transfer and to 
rapidly annotate scRNA-seq datasets using the 
Python package CellTypist (v.0.1.9) (68). 


Differential lineage priming and progenitor cell 
fate predictions 


The CellRank package (v1.5.1) was used to de- 
fine and rank fate probabilities of terminal state 
transitions across annotated hematopoietic 
lineages in the YS and iPSC scRNA-seq data- 
sets. In the YS data, cell clusters broadly an- 
notated to be in the myeloid lineage were first 
subsetted from the YS data. After refinement, 
DCs were excluded from this subset. We did 
not identify any DCs in the <CS14 (pre-AGM) 
myeloid lineage. Macrophage trajectory infer- 
ence was then constructed across the myeloid 
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subset (fig. SSE). Cells were divided by donors 
aged <CS14 and >CS14 (post-AGM) and trajec- 
tory inference recomputed on new embeddings. 
First-order kinetics matrices were imputed for 
each dataset using the pp.moments function 
(n_pcs = 20, n_neighbors = 30) in the scVelo 
package (v0.2.4). A Cytotrace pseudotime for 
state transitions across each dataset was then 
computed to direct graph-edges toward esti- 
mated neighborhood regions of increasing 
differentiation using the Cytotrace kernel pro- 
vided within the CellRank package. The re- 
sultant KNN and Cytotrace pseudotime were 
used to compute a probability transition matrix 
with the compute _transition_matrix command 
in Cytotrace. Neighborhoods of cells repre- 
senting terminal states of differentiation were 
identified using true Schur matrix eigen de- 
composition of the transition matrix compute_ 
schur (n_components = 20, method = brandts), 
followed by the compute_macrostates (n_states = 
10) command in Cytotrace. The resultant ter- 
minally differentiated cell states were then 
manually selected if multiple terminal states 
were identified per lineage. Fate absorption 
probabilities were then computed across all 
cells terminating at each prespecified termi- 
nal cellstate neighborhood using the compute_ 
absoprtion_probabilities command in CellRank. 
Fate probabilities were then presented as a 
circular plot using the pl.circular_projection 
with embedding proximity to terminal edges of 
the graph representing the fate-transition prob- 
ability of a particular cell toward the prespe- 
cified terminally differentiation state. HSPC 
progenitor population density was then com- 
puted by KDE of a precomputed UMAP high- 
lighting relative probabilities of HSPC lineage 
priming (KDE calculated using the d.embedding. 
density function in Scanpy). 

For HSPC lineage priming analyses which 
included the respective embryonic erythroid and 
erythroid terminal states, embryonic erythroid 
states were defined as any erythroid cell with a 
HBZ module z-score > 0, and erythroid as any 
erythroid cell with individual module z-score 
of HBA1, HBA2, HBG1, HBG2, HBD > 0. 


pySCENIC for regulon analysis 


The pySCENIC package (v0.9.19) was used to 
identify TFs and their target genes in the YS 
and iPSC scRNA-seq datasets. The ranking data- 
base (hg38 refseq-r80 500bp_up_and_100bp_ 
down_tss.mc9nrfeather), motif annotation data- 
base (motifs-v9-nr.hgnc-m0.001-00.0.tbl) and 
list of TFs (ambert2018.txt) were used. An ad- 
jacency matrix of TFs and their targets was 
generated. TF activity from the AUcell output 
was modeled along diffusion pseudotime rank- 
ings of each trajectory and used to train a non- 
linear Generalized Additive Model (nlIGAM) 
using the pyGAM.LinearGAM model to iden- 
tify TF modules which significantly changed 
across each lineage pseudotime. A gridsearch 
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of between 50 and 200 splines were calculated. 
Significantly changing TF regulons across pseudo- 
time were classified with a P < 0.05 and reported 
in Fig. 5, D and H, and data S30). Regulon 
matrix heatmaps were plotted using the Seaborn 
(v0.12.1) package in Python. Regulon scores were 
variance-scaled and min-max-standardized 
with a distribution of 0 to 1. 


Cell-cell interaction predictions using CellPhoneDB 


To assign putative cell-cell interactions within 
the YS scRNA-seq dataset, we used CellPhoneDB 
(v2.1.2). Log-transformed, normalized, and scaled 
gene expression values for all cell states were 
exported. CellPhoneDB was run using the statis- 
tical method using the receptor-ligand database 
(v2.0.0) with a significance P cutoff of 0.05 (data 
$28 and S29). Outputs were ranked by log-mean 
expression for interactions between cell types 
of interest in each analyses and plotted as a 2- 
scored heatmap to show standard deviations 
from mean for each receptor-ligand pair. 


Hiplex RNAscope 


Human YS tissue (8 PCW) was frozen in OCT 
compound (Tissue-Tek). 12-plex smFISH was 
performed using the RNAscope HiPlex v2 as- 
say (ACD, Bio-Techne) on three cryosections 
(0 um) per manufacturer’s instructions, using 
the standard pretreatment for freshly frozen 
samples and permeabilized with Protease III, 
for 15 min at room temperature. The imaging 
cycles, primary probes and label fluorophores 
were: Cycle1_KLRB1_AlexaFluor488, Cycle1_ 
CDIC_Dylight550, Cyclel_IL7R_Dylight650, 
Cyclel_SPINK2_AlexaFluor750, Cycle2_P2RY12_ 
AlexaFluor488, Cycle2_TNFA_Dylight550, Cycle2_ 
LGALS3_Dylight650, Cycle2_IL33_AlexaFluor750, 
Cycle3_PLVAP_AlexaFluor488, Cycle3_SPINK1_ 
Dylight550, Cycle3_CIQA_Dylight650, Cycle3_ 
ACTA2_AlexaFluor750, Cycle4_P2RY12_Opal570, 
and Cycle4_IBA1_Cy5. Slides were counterstained 
with DAPI and coverslipped for imaging. 
For protein validation, slides were fixed with 
4% (w/v) PFA for 60 min at room temperature 
and then washed and dehydrated in an ethanol 
gradient (50 to 100%) for 5 min each. Sections 
were treated with Protease III (ACD, Bio-Techne) 
for 15 min at room temperature, then washed 
with PBS before blocking in 10% (v/v) normal 
donkey serum containing 1% (w/v) Triton X-100 
and 0.2% (w/v) gelatin for 60 min at room tem- 
perature. Primary antibodies were incubated 
at 4°C overnight, then washed three times for 
20 min each with a wash buffer [0.1% (w/v) 
Triton X-100 in PBS]. Slides were blocked with 
HRP Block (ACD, Bio-Techne) for 60 min at 
room temperature, and washed with ACD Wash 
Buffer (ACD, Bio-Techne) before addition of sec- 
ondary antibody and incubation for 60 min at 
room temperature. Slides were washed three 
times for 20 min each [0.1% (w/v) Triton X-100 
in PBS]. TSA-Opal570 was added for 10 min at 
room temperature, then washed three times 


18 August 2023 


with ACD Wash Buffer. Slides were counter- 
stained with DAPI and coverslipped for imaging. 

Imaging was performed on a custom two- 
camera spinning disk confocal microscope built 
around a Crest Optics X-light v3 module by 
Cairn Research, a scientific equipment manu- 
facturer. The instrument was controlled using 
the Micro-Manager software (70). All imaging 
was performed in spinning disk confocal mode 
with a 40X water immersion objective [numer- 
ical aperture (NA) 1.15, 180 nm per pixel] and 
1.5-um g-step using Prime BSI Express (Teledyne 
Photometrics) camera. 


RNAscope image analysis 


Before each imaging experiment, a slide covered 
in a sparse layer of 0.5-m Tetraspeck beads was 
imaged in all channels. The bead images in 
all channels were then registered against the 
beads in the DAPI channel, and their respec- 
tive affine transforms were saved. 

After imaging, each individual tile was z- 
projected with a maximum intensity projection, 
then the channels were transformed using the 
saved affine transforms. The projected, trans- 
formed tiles were saved back to a temporary di- 
rectory along with a bigstitcher-compatible XML 
file. The BigStitcher software (77) was then used 
to stitch the transformed tiles together and the 
final stitched image exported for further analysis. 

All imaging cycles for a given tissue section 
were registered in two steps. First, we used fea- 
ture registration algorithm implemented in 
Python via OpenCV-contrib library (version 
4.3.0) (72) to compute an affine transforma- 
tion of DAPI channel from cycle r > 1 (moving 
image) with respect to DAPI channel from the 
first cycle r = 1 (reference image). Key points 
were detected using the FAST feature detector, 
whose surrounding areas were described using 
the DAISY feature descriptor, and the FLANN- 
based matcher was used to find correspon- 
dences between pairs of key points from reference 
and moving images and filter out unreliable 
points. The remaining key points were pro- 
cessed using the RANSAC-based algorithm that 
aligns them and estimates affine transforma- 
tion parameters with four degrees of freedom. 

For the second registration step, a nonlinear 
registration algorithm based on Farneback 
optical-flow available in Python via OpenCV 
library was used to achieve more accurate reg- 
istration by warping images locally. Specifi- 
cally, local warping was computed using the 
DAPI channel, from cycle r > 1 with respect to 
the corresponding channel of the first round. 
The computational pipeline implementing these 
registration steps was optimized so that it could 
be performed efficiently on large images. The 
corresponding code for feature registration is 
available at https://github.com/BayraktarLab/ 
feature_reg, and the code for optical-flow reg- 
istration is available at https://github.com/ 
BayraktarLab/opt_flow_reg. 
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Immunohistochemistry 

Formalin-fixed, paraffin-embedded blocks of YS 
4 to 8 PCW, embryonic liver 7 to 8 PCW, and 
healthy adult liver were sectioned at 4-m thick- 
ness onto slides coated with 3-aminopropyltrie- 
thoxysilane (APES). 

For hematoxylin and eosin staining, slides 
were dewaxed in xylene and rehydrated through 
graded ethanol, as previously described (0). 
Rehydrated slides were incubated for 5 min in 
Mayer’s hematoxylin (Dako, Agilent), rinsed in 
tap water, and then differentiated for 2 s in 
acid alcohol before washing in tap water fol- 
lowed by Scott’s tap water substitute (Leica Bio- 
systems). Sections were counterstained in triple 
eosin (Dako, Agilent) for 5 min before being 
rinsed in tap water, dehydrated through graded 
ethanol (70 to 99%), and then placed in xylene 
before mounting with DPX (Dako, Agilent). 

For immunohistochemistry (IHC), dewax- 
ing, rehydration, and staining was performed 
using the Discovery Ultra auto Stainer and kits 
(Ventana, Roche) following the manufacturer’s 
protocols. Primary and secondary antibodies 
and their concentrations are listed in data $23. 
Slides were counterstained with one drop of 
hematoxylin II (Ventana, Roche) for 8 min, 
rinsed with Reaction Buffer and one drop of 
Bluing reagent (Dako, Agilent) added for 4 min. 
The slide was then rinsed with a Reaction buffer, 
before being dehydrated by hand through graded 
ethanol (70 to 99%), placed in xylene and 
mounted with DPX (Dako, Agilent). 

Rabbit polyclonal anti-human alpha-1-feto- 
protein (AFP; Agilent) staining was performed 
by NovoPath, Newcastle upon Tyne NHS Trust, 
using a proprietary method. 

For the Martius Scarlet eBlue (MSB) stain, 
slides were dewaxed in xylene and rehydrated 
through graded ethanol as previously published 
(0). Rehydrated slides were placed in Bouins’ 
fixative (Atom Scientific) for 1 hour at 60°C, 
washed in running water, incubated in Weigert’s 
solution (Atom Scientific) for 10 min and washed 
in water. Slides were differentiated in 0.9% 
ethanol for 1 to 2 s before rinsing in tap water 
followed by Scott’s tap water substitute (Leica 
Biosystems), distilled water and finally 95% 
ethanol. Slides were then incubated stepwise 
in Martius yellow (8 min) (Atom Scientific), 
Brilliant crystal scarlet (6 min) (Atom Scien- 
tific), and 50% (v/v) Methyl blue (2 min) (Atom 
Scientific), washing with distilled water be- 
tween each stain. Slides were washed in tap 
water, rapidly dehydrated (2 to 3 min) through 
graded ethanol (70 to 99%), then placed in 
xylene before mounting with DPX mountant 
(Dako, Agilent). 

All slides were imaged at 20X magnification 
on a NanoZoomer S360 (Hamamatsu) digital 
slide scanner. MSB stained images were de- 
convolved into respective Martius yellow, crys- 
tal scarlet and methyl blue channels using the 
Colour Deconvolution plugin (v1.8) (Masson 
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Trichrome) in FIJI with thresholds set using the 
Otsu method. Pseudocolors for each deconvolved 
channel were then assigned as in Fig. 2C. 


ASGRI and CD34 immunofluorescence microscopy 


YS sections were baked onto slides for 2 hours 
at 60°C before being dewaxed in xylene and 
rehydrated through graded ethanol as previ- 
ously described (70). Slides were washed with 
distilled water then placed in a pressure cooker 
with boiling citrate buffer pH 6 [10 mM citric 
acid (Sigma), 0.05% v/v Tween 20 (Sigma) in 
deionized (DI) water] for 2 min for antigen 
retrieval. Slides were then washed for 3 min 
with distilled water followed by 3 min in PBS 
(Sigma). Sections were blocked with 20% (v/v) 
goat serum (R&D Systems) for 45 min at room 
temperature. Primary antibodies were diluted 
in blocking solution (data $23), added to the 
sections and incubated for 1 hour at room tem- 
perature. Slides were washed twice for 3 min 
each in a wash buffer [0.1% (w/v) Triton X 
(Sigma) in PBS], then twice for 3 min each in 
PBS. Secondary antibodies (see data S23) were 
diluted in blocking solution, added to section 
and incubated for 2 hours at room temperature. 
The wash step was repeated and then 300 nM 
DAPI (Sigma) was added. Slides were incu- 
bated for 5 min before washing with PBS. 
Slides were then mounted with ProLong Dia- 
mond Antifade (Thermofisher) and imaged on a 
Zeiss Axioimager with Zeiss ZEN pro software. 


SMA and LYVE1/CD34 
immunofluorescence microscopy 


PFA-fixed YS was cryoprotected with sucrose 
10%, embedded in gelatin-sucrose solution [7.5% 
x/v gelatin (VWR 24350.262), 10% w/v sucrose 
(VWR27478.296), in 0.12M PBS], frozen at —50°C, 
then sectioned at 14 um. Slides were stored at 
-80°C until use, dried for 30 min, then blocked 
with PBS Gelatin Triton [0.2% w/v gelatin, 
0.25% Triton X-100 (Sigma-Aldrich) in PBS] 
for 1 hour. Primary antibodies were diluted 
in blocking solution (data $23), added to the 
sections, and incubated overnight. Slides washed 
with PBS three times at 10-min intervals. Sec- 
ondary antibodies were diluted in blocking 
solution and added to sections to incubate for 
2 hours (data S23). Hoechst 33258 (Sigma- 
Aldrich) was added to the secondary antibody 
solution. Sections were washed with PBS three 
times at 10-min intervals, and coverslips were 
mounted with Mowiol (Calbiochem). Sections 
were imaged at 20X magnification on Leica 
DMG6000 widefield microscope with MetaMorph 
software. Brightness and contrast were adjusted, 
and a scale bar was added with FIJI (73). 


Light-sheet fluorescence microscopy 


Candidate antibodies were screened by im- 
munofluorescence on cryosections obtained 
from OCT-embedded specimens as previously 
described (0, 57). Routine light-sheet immuno- 
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fluorescence microscopy was then performed 
on floating whole-mount YSs as previously 
described, with primary antibody incubation 
reduced to 10 days and secondary reduced to 
2 days, both at 37°C to preserve tissue integ- 
rity. Antibody and other reagents including 
nuclear marker TO-PRO-3 iodide are specified 
in data S23. YSs were embedded in 1.5% agarose 
blocks before solvent-based clearing as previ- 
ously described (57). YS retained its spherical 
shape throughout the procedure. Imaging was 
performed as previously described in dibenzyl 
ether with a Miltenyi Biotec Ultramicroscope 
Blaze (sCMOS camera 5.5MP controlled by In- 
spector Pro 7.3.2 acquisition software), which 
generates light sheets at excitation wavelengths 
of 488, 561, 640, and 785 nm. Objective lenses 
of 4X magnification (MI Plan 4X, NA 0.35) 
and 12X magnification (MI Plan, NA 0.53) 
were used. Imaris (v9.8, BitPlane) was used for 
image conversion, processing, and video pro- 
duction. Blender 3.0 was used to edit videos 
and add text. All raw image data are available 
on request (A.C. and M.H.). 


Statistics and reproducibility 


The number of cells from each cell type in each 
de novo single-cell dataset provided in this 
manuscript are provided in data S4. 
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Design of stimulus-responsive two-state hinge proteins 
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In nature, proteins that switch between two conformations in response to environmental stimuli structurally 
transduce biochemical information in a manner analogous to how transistors control information flow 
in computing devices. Designing proteins with two distinct but fully structured conformations is a 
challenge for protein design as it requires sculpting an energy landscape with two distinct minima. Here 
we describe the design of “hinge” proteins that populate one designed state in the absence of ligand 
and a second designed state in the presence of ligand. X-ray crystallography, electron microscopy, 
double electron-electron resonance spectroscopy, and binding measurements demonstrate that despite 
the significant structural differences the two states are designed with atomic level accuracy and that 
the conformational and binding equilibria are closely coupled. 


Ithough many naturally occurring pro- 

teins adopt single folded states, con- 

formational changes between distinct 

protein states are crucial to the functions 

of enzymes (J, 2), cell receptors (3), and 
molecular motors (4). The extent of these changes 
ranges from small rearrangements of secondary 
structure elements (5, 6) to domain rearrange- 
ments (7) to fold-switching or metamorphic 
proteins (8) that adopt completely different 
structures. In many cases, these conformational 
changes are triggered by “input” stimuli such 
as binding of a target molecule, post transla- 
tional modification, or change in pH. These 
changes in conformation can in turn result in 
“output” actions such as enzyme activation, 
target binding, or oligomerization (9); protein 
conformational changes can thus couple a spe- 
cific input to a specific output. The generation 
of proteins that can switch between two quite 
different structural states is a difficult chal- 
lenge for computational protein design, which 
usually aims to optimize a single, very stable 
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conformation to be the global minimum of the 
folding energy landscape (10, 11). Design of 
such proteins requires reframing the design 
paradigm towards optimizing for more than 
one minimum on the energy landscape while 
simultaneously avoiding undesired off-target 
minima (72). Previously, multistate design has 
been used to design proteins that undergo very 
subtle conformational changes (13, 14), cyclic 
peptides that switch conformations based on 
the presence of metal ions (15), and closely re- 
lated sequences that fold into substantially dif- 
ferent conformations (/6). Stimulus-responsive 
“LOCKR” (Latching, Orthogonal Cage/Key 
pRotein) proteins have been designed to under- 
go conformational changes upon binding to a 
target peptide or protein (17). The “closed” un- 
bound state of these “switch” proteins is a well- 
defined and fully structured conformation, but 
the “open” bound state is a broad distribution 
of conformations. The LOCKR platform has been 
used to generate biosensors (8, 19), but he lack 
of a defined second state makes it poorly suited 
for mechanical coupling in molecular machines 
or discrete state-based computing systems. 


Hinge Design Method 


We set out to design proteins that can switch 
between two well-defined and fully structured 
conformations. To facilitate experimental char- 
acterization of the conformational change and 
to ensure compatibility with downstream ap- 
plications, we imposed several additional re- 
quirements. First, the conformational change 
between the two states should be large, with 
some inter-residue distances changing by tens 
of angstroms between the two states. Second, 
the conformational change should not require 
global unfolding, which can be very slow. Third, 
neither of the two states should have substan- 
tial exposed patches of hydrophobic residues, 
which can compromise solubility. Fourth, the 
conformational change should be readily cou- 
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ties in one protein system is challenging: protein 
conformations that differ considerably typically 
will have different sets of buried hydrophobic 
residues and require substantial structural re- 
arrangements for interconversion. 

We reasoned that these goals could be col- 
lectively achieved with a hinge-like design in 
which two rigid domains move relative to each 
other while remaining individually folded. The 
hinge amplifies small local structural and chem- 
ical changes to achieve large global changes 
whereas the chemical environment for most 
residues remains similar throughout the con- 
formational change, avoiding the need for 
global unfolding. Provided that the two states 
of the hinge bury similar sets of hydrophobic 
residues, the amount of exposed hydrophobic 
surface area can be kept low in both states. 
Designing one of the resulting conformations 
to bind to a target effector couples the confor- 
mational equilibrium with target binding (Fig. 
1A). This design concept has precedent in na- 
ture; for example, bacterial two-component sys- 
tems use binding proteins that undergo hinging 
between two discrete conformations in response 
to ligand binding (20). 

To implement this two-state hinge design 
concept, we took advantage of designed helical 
repeat proteins (DHRs) (27) (Fig. 1, B and C, 
left) and DHR-based junction proteins (22). 
The backbone conformation of the DHR serves 
as the first conformational state of our hinge 
protein (“state X”). To generate a second con- 
formation, a copy of the parent protein is ro- 
tated around a “pivot helix” (Fig. 1, B and C) and 
a new backbone conformation is then created 
by combining the first half of the original pro- 
tein (“domain 1”), the second half of the copy 
(“domain 2”), and either the helix following the 
pivot helix from the original protein or the helix 
preceding the pivot helix from the rotated copy 
(“peptide”). Rosetta FastDesign with backbone 
movement (23, 24) is used to redesign the in- 
terface between the three parts, and the two 
domains are connected into a single chain using 
fragment-based loop closure (21, 25, 26). Using 
a combination of Rosetta two-state design (see 
methods section for details) and protein MPNN 
(27) with linked residue identities, a single amino 
acid sequence is generated that is compatible 
with the state X hinge as well as with the state 
Y hinge-peptide complex. AlphaFold2 (AF2) 
(28) with initial guess (29) is then used to pre- 
dict the structure of the hinge with and without 
the effector peptide, allowing for the selection 
of designs that are predicted in the correct 
state X in absence of the peptide and in the 
correct state Y complex in presence of the pep- 
tide. To favor designs that are predominantly in 
the closed state in absence of the peptide (Fig. 
1, Aand D), designs are selected only if state X 
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has lower energy (computed using Rosetta) 
than state Y in absence of the peptide, and if 
the state Y complex has lower energy than state 
X plus spatially separated peptide. Designs 
are also filtered on standard interface design 
metrics for the bound conformation (see Meth- 
ods for details on filtering) (30). 


Hinges bind effector peptides with 
sub-nanomolar to low micromolar affinities 


We used our hinge design approach to gener- 
ate hinge-peptide pairs that span a wide struc- 
tural space (Figs. 1D and 2A and figs. S1 and 
$2). We experimentally tested multiple rounds 
of designs, using both DHRs (27) and helical 
junctions (22) as input scaffolds, and improv- 
ing individual steps of the design pipeline be- 
tween iterations (see Supplementary Note 1 for 
details on screening and a discussion of suc- 
cess rates and failure modes). We selected hinge 
and GFP-fused peptide designs that were solu- 


Fig. 1. Strategy for designing A 
proteins that can switch 

between different conforma- 
tions. (A) (Left) reaction 

scheme for a protein (blue) that 
undergoes a conformational 

change and can bind an effector 
(orange) in one (circle) but 

not in the other conformational 

state (square). (Right) Energy 
landscape for the system shown B @ 
on the left. (B) Schematic (5) 
epresentation of the hinge (3) 
design approach. Alpha-helices 
are represented as circles 

(top view, top) or cylinders (side 
view, bottom). (From left to 
right) A previously designed 
repeat protein (gray) serves as 
the first conformation of the 
hinge. To generate the second 
conformation, a copy of the 
repeat protein (green) is moved 
by shifted alignment along a 
pivot helix, causing a rotation 
(top and bottom, indicated by 
the circular arrow) and a 
translation along the helix axis 
(bottom). The first 4 helices 

of the original protein form 
domain 1 of the hinge, the last 
4 helices of the rotated copy 
form domain 2, and an additional 
helix is copied over from the 
original protein to serve as an 
effector peptide (orange) that 


shift by * combine 


alignment 
—— >> 


ble and interacted with each other as judged 
by size exclusion chromatography (SEC, figs. 
$2 and S3) and performed further character- 
ization by fluorescence polarization (FP). Hinge- 
peptide binding affinities obtained from FP 
titration experiments with constant peptide 
concentration and varying hinge concentra- 
tions ranged from 1 nM to the low uM range 
(Fig. 2B, fig. S4, and table S1). To circumvent 
the bottleneck of finding soluble peptide se- 
quences (see Supplementary Note 1), we also 
sought to design hinges that bind to a given 
target peptide. Starting from design cs201, 
we used a modified version of our design pipe- 
line to redesign the hinge to bind peptides 
csO74B or cs221B, respectively, which have 
similar hydrophobic fingerprints as the original 
target peptide cs201B. This one-sided, two-state 
design approach yielded hinge designs that 
bound strongly to their new target peptide with 
little or no off-target binding (fig. S5). 


= 
& 
Energy 


domains 
—_» 


can bind to this second conformation of the hinge. The two domains of the hinge are connected into one 
continuous chain (blue) using fragment-based loop closure, and a single amino acid sequence is designed to 
be compatible with both conformations. (€) Design steps from (B) illustrated using cartoon representations of an 
exemplary design trajectory. (D) Exemplary design models of a designed hinge protein in state X (left), state 

Y (center), and in state Y bound to an effector peptide (right). Hinge is shown in blue, peptide in orange. 
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Effector binding modulates the hinge 
conformational equilibrium 

To characterize the conformational equilib- 
rium of the designed hinges, we introduced 
two surface cysteine residues into the hinge 
protein and covalently labeled them with the 
nitroxide spin label MTSL (37). We then used 
double electron-electron resonance spectros- 
copy (DEER) to determine distance distributions 
between the two spin labels and compared 
these with simulated (32) distance distribu- 
tions based on the state X and state Y design 
models. This experiment was performed on 
two different labeling site pairs for each design: 
one pair where the distance is predicted to 
decrease in the presence of peptide (Fig. 2C 
and S4, C and D) and the other where it is 
predicted to increase (Fig. 2D and fig. S4, C 
and D). In the absence of the peptide, the ob- 
served distance distributions closely matched 
the state X simulations. In all cases the dis- 
tances between the two pairs of probes shifted 
upon addition of peptide to better match the 
state Y simulations, suggesting that addition 
of effector peptide causes the conformational 
equilibrium to shift toward state Y as designed. 
For example, cs074 (site pair 1) showed a clear 
peak between 40 and 50 A in absence of the pep- 
tide, and a peak between 30 and 40 A in presence 
of the peptide, and both peaks agree well with 
the corresponding simulations (Fig. 2C, top row). 
In a control experiment using the static parent 
DHR of design cs074, the distance distributions 
with and without peptide were identical and 
matched both the simulation for the parent 
design model, which closely resembles state X, 
and the experimental DEER distance distrib- 
ution for state X of csO74 (fig. S4D). 

We solved crystal structures for two designs, 
cs207 and cs074. For design cs207, crystals 
were obtained from two separate crystalliza- 
tion screens: one screen for the hinge alone, 
and one screen for the hinge in complex with 
the target peptide. In the absence of peptide 
the experimental structure agrees well with the 
state X design model (Fig. 3A), and the struc- 
ture of the hinge-peptide complex agrees well 
with the state Y design model (Fig. 3B). The 
crystal structures of hinge cs207 in both de- 
signed states demonstrate the accuracy with 
which two very different conformational states 
of the same protein can now be designed. For 
design cs074, the crystal structure of the hinge- 
peptide complex agrees well with the corre- 
sponding state Y design model (Fig. 3C). 

One major advantage of de novo designed 
proteins is their robustness to conditions that 
typically destabilize native proteins, such as 
high temperatures, and to structural pertur- 
bations, such as mutations, genetic fusion, and 
incorporation in designed protein assemblies. 
Circular dichroism (CD) melts show that the 
hinges remain folded at high temperatures 
(fig. S6), like the DHRs they were based on 
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Fig. 2. Experimental validation of peptide-binding hinges. (A) Design models of 
hinges (blue) and peptides (orange) in state X (left model) and state Y bound to the 
peptide (right model). Gray shades behind models in state X and Y indicate the 
corresponding states Y and X, respectively. (B) Fluorescence polarization (FP) 
titrations with a constant concentration of TAMRA-labeled peptide (0.1 nM for cs074 
and cs221; 0.5 nM for cs201; 1 nM for cs094, cs207, and jsO07) and varying 
hinge concentrations. Circles represent data points from four independent 
measurements; lines are fits of standard binding isotherms to all data points; and 
dissociation constants (Kp) are obtained from those fits. (© and D) Distance 
distributions between spin labels covalently attached to cysteine side chains. Solid 


lines are obtained from DEER experiments without (blue) or with (orange) an excess 
of peptide, shaded areas are 95% confidence intervals, and dashed lines are 
simulated based on the design models for state X (blue) or the state Y complex 
(orange). For each hinge two different label site pairs were tested, one in which the 
distance was expected to decrease with peptide binding (C) and one in which the 
distance was expected to increase upon peptide binding (D). Chemically synthesized 
peptides were used for all measurements except for cs074 site pair 1, for which 
sfGFP-peptide fusion was used. For design cs094, the residual state X peak in 
presence of the peptide likely reflects incomplete binding due to weak binding 
affinity or insufficient peptide concentration. 


(21). To test whether our hinges can be incor- 
porated as components of more complex pro- 
tein assemblies without affecting their ability to 
undergo conformational changes, we designed a 
fully structured C3-symmetric protein with three 
hinge arms (Fig. 3D). We used inpainting (33) 
with RoseTTAFold (34) to rigidly connect one 
end of hinge cs221 to a previously validated 
homotrimer (35, 36) and the other end of the 
hinge to a previously validated monomeric 
protein (37). Negative-stain electron microscopy 
(nsEM) with reference-free class averaging shows 
straight arms in absence of peptide and bent 
arms in presence of peptide cs221B, corrobo- 
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rating the designed conformational change 
(Fig. 3D and fig. S7). 

A critical feature of two-state switches in bi- 
ology and technology is the coupling between 
the state control mechanism and the popula- 
tions of the two states. To quantitatively in- 
vestigate the thermodynamics and kinetics of 
the effector-induced switching between the two 
states of our designed hinges, we used Forster 
resonance energy transfer (FRET). To increase 
both the absolute distance from N- to C- terminus 
and the change in termini distance between 
the two conformational states, we took advan- 
tage of the extensibility of repeat proteins and 


18 August 2023 


extended hinges cs201, cs221, and cs074 by 
1 to 2 helices on their N and C termini, yielding 
cs201F, cs221F, and cs074F, respectively (Fig. 4A, 
first column). Single cysteines were introduced 
in helical regions near the termini of the ex- 
tended hinges and stochastically labeled with 
an equal mixture of donor and acceptor dyes. 
For cs201F, the dye distance is above Ro in state 
X and below Ro in state Y, and hence acceptor 
emission upon donor excitation decreases upon 
addition of peptide cs201B (Fig. 4A, second 
column). For hinges cs074F and cs221F the 
distance between the label sites is above the Rg 
of the dye pair in state X and below Ro in state 
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cs207 


Fig. 3. Close agreement between crystal structures and design models for 
both designed states. (A) Design model (blue) of hinge cs207 in state X overlaid 
with crystal structure (gray) of hinge cs207 crystallized without peptide. Right panel 
shows a close-up view of the side chains in the interface between the two hinge 
domains (side chain colors follow a spectrum from blue to red from N- to 
C-terminus). (B) Design model (hinge in blue, peptide in orange) of the cs207 state 
Y hinge-peptide complex overlaid with crystal structure (gray) of hinge cs207 co- 
crystallized with peptide cs207B. Right panel shows a close-up view of 
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monomer 
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match the corresponding side chains in (A) and peptide side chains are shown in 
dark gray]. (C) Design model (hinge in blue, peptide in orange) of hinge csO74 in 
state Y overlaid with crystal structure (gray) of hinge csO74 co-crystallized with 
peptide cs207B. Representative electron densities for all crystal structures are 
shown in fig. S19. RMSD values between design model and experimental structure 
are given in table S4. (D) (Left) Components for design of a C3-symmetric 
homotrimer with three cs221 hinge arms. (Center) Design model of the hinge-armed 
trimer in state X (top) and in state Y (bottom). (Right) nsEM class averages of the 


the side chains in the interface between hinge and pept! 


Y and hence acceptor emission upon donor 
excitation increases upon addition of the cor- 
responding peptides csO74B and cs221B, res- 
pectively (Fig. 4A, second column). We used 
labeled, extended DHR82, the parent protein 
for cs074F, as a static control, and observed 
fluorescence spectra comparable to cs074F but 
observed no change in fluorescence upon ad- 
dition of the peptide (fig. S8, A and B). To test 
specificity of our hinge-peptide pairs, we per- 
formed pairwise titrations of all three labeled 
hinges at 2 nM with all three target peptides 
at varying concentrations. The on-target titra- 
tions had sigmoidal transitions that can be 
fitted with standard binding isotherms (Fig. 4A, 
third column; S8C), whereas the off-target titra- 
tions for cs201F and cs221F show flat lines, in- 
dicating no conformational change of these 
hinges upon addition of off-target peptides at 
micromolar concentrations. cs074F showed 
weak off-target binding that was three orders 
of magnitude weaker for cs201B and two orders 
of magnitude weaker for cs221B compared with 
the on-target interaction for cs074B. cs201F and 
cs221F are thus orthogonal from the nanomolar 
to the micromolar range, and the set of cs201F, 
cs221F, and cs074F is orthogonal over two orders 
of magnitude of effector concentration. 
Association kinetics for the on-target in- 
teractions measured using constant concen- 
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ide [hinge side chain colors 


trations of labeled hinge and varying excess 
concentrations of peptide are well fit by single 
exponentials (Fig. 4A, fourth column, and fig. 
89). The apparent rate constants increase lin- 
early with increasing peptide concentration, 
exhibiting standard pseudo-first order kinetics 
for bimolecular reactions (Fig. 4A, fifth column, 
and fig. S9). We analyzed these data using a 
model comprising three states (X, Y, Y+peptide) 
and four rate constants (Fig. 4B). The kinetic 
measurements using the FRET system follow 
the decrease in state X over time (d[X]/dt) upon 
the addition of peptide. The observed pseudo- 
first order behavior (Fig. 4A, fifth column) indi- 
cates that the conformational change happens 
on a timescale that is faster than that of the 
observed binding and can be treated as a fast 
pre-equilibrium (Supplementary Note 2). The 
slopes of the linear pseudo-first order fits (Kon) 
can thus be interpreted as the product of the 
microscopic association rate k, and the frac- 
tional population of state Y in absence of the 
peptide (Fy = [Y]/(LX]+[Y]), see Supplementary 
Note 2). FP-based titrations and kinetic charac- 
terization using the unlabeled extended hinge 
csO74F in excess over the TAMRA(Tetrame- 
thylrhodamine)-labeled peptide cs074B agree 
well with the corresponding FRET experiments, 
further supporting the pre-equilibrium model 
(Fig. 4C and fig. S9). FP kinetics experiments 


18 August 2023 


trimer in absence (top) and in presence (bottom) of peptide cs221B. 


for other hinge designs also follow pseudo- 
first order behavior with k,, values ranging 
from 2.5 x 10? M's"! to 7.8 x 10* M“'s"! (figs. 
S4B and S10). To study the reversibility of 
hinge conformational changes, we started with 
30 nM of FRET-labeled hinge cs201F, added 
200 nM peptide to drive the conformational 
change, and then added excess unlabeled hinge 
cs201 to compete away the peptide (Fig. 4D). 
The FRET signal decreased upon addition of 
the peptide, consistent with conformational 
change from state X to state Y, and then returned 
to nearly the original level upon addition of un- 
labeled hinge, indicating that the hinge confor- 
mational change is fully reversible. 

To explore whether peptide-responsive hinges 
could be turned into protein-responsive hinges, 
we used inpainting with RoseTTAFold to add 
two additional helices to a validated effector 
peptide, resulting in fully structured 3-helix 
bundles (3hb). For nine of our validated hinges 
we designed and experimentally characterized 
these effector proteins using SEC (Fig 4E and 
figs. S11A and S12). Hinge-3hb binding was 
tested qualitatively by SEC and, for hinges which 
had a corresponding FRET construct, quantita- 
tively with the FRET-labeled variant, and DEER 
was used in addition to FRET to confirm that 
3hb binding caused the same conformational 
change as effector peptide binding (Fig. 4E, 
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Fig. 4. Quantitative analysis of conformational changes in designed 

hinge proteins. (A) FRET-based characterization of three extended hinges. 
(From left to right) cylindrical representation of extended hinges (blue) and their 
corresponding target peptides (green, cs201B; pink, cs221B; orange, cs074B) 
with red stars indicating attachment sites for fluorescent dyes; fluorescence 
spectra (excitation at 520 nm) of labeled hinge without (blue) or with (green/ 
pink/orange) target peptide; FRET-based binding titrations (excitation 520 nm, 
emission 665 nm) at 2 nM labeled hinge and varying peptide concentrations 
fitted with standard binding isotherms (solid lines); time course after mixing 

2 nM (cs201F, csO074F) or 5 nM (cs221F) labeled hinge and 100 nM peptide fitted 
with a single-exponential equation (black line); and apparent rate constants 
obtained from single-exponential kinetic fits plotted against absolute peptide 
concentrations (circles) and fitted with a linear equation (black line). Dotted lines 
in spectra indicate acceptor and donor emission peaks. (B) Kinetic model 
describing the coupling of the conformational equilibrium to the binding 
equilibrium. X and Y, hinge in state X and Y, respectively; P, peptide; YP, peptide 
bound to hinge in state Y. ky, k-y, ko, and k_p are the microscopic rate constants. 
(C) FP characterization of unlabeled extended hinge csO74F. (From left to right) 
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bottom, and fig. S11). The affinity of 3hb05 to 
cs074F was similar to the affinity observed for 
the original peptide cs074B (Fig. 4E), whereas 
3hb21 bound its target hinge cs221F significantly 
tighter than the original peptide cs221B (fig. 
S13). The 3hb approach was able to rescue 
designs for which the peptide alone or the hinge- 
peptide complex had shown the tendency to 
form higher-order oligomers (fig. S12). For two 
designs, 3hb05 and 3hb12, we obtained crystal 
structures that agreed well with the design mod- 
els, indicating that the three-helix bundles are 
fully structured in isolation (Fig. 4E, top right, 
and fig. S14). 


The conformational pre-equilibrium controls 
effector binding 


To test the effect of the conformational pre- 
equilibrium on effector binding, we introduced 
disulfide “staples” that lock the hinge in one 
conformation. Using FP we analyzed peptide 
binding to stapled versions of hinge cs221 (Fig. 
5, A and B). The variant that forms a disulfide 
bond in state X (“locked X”) showed only weak 
residual binding, likely due to a small fraction 
of hinges not forming the disulfide (Fig. 5A). 
Upon addition of the reducing agent dithio- 
threitol (DTT) to break the disulfide, peptide 
binding was fully restored, making this hinge 
variant a red/ox-dependent peptide binder that 
binds the effector peptide under reducing— 
but not oxidizing—conditions. The association 
rate for the locked Y variant was 200 times 
higher than that for the original hinge without 
disulfides (Fig. 5B and fig. S15, A and B; de- 
spite this increase the overall binding affinity 
was weaker, suggesting the disulfide may lock 
the hinge in a slightly perturbed version of state 
Y). Using the pre-equilibrium model described 
above, the observed association rates provide 
an estimate of the fraction of hinge that is in 
state Y in absence of the peptide: a 200-fold 
higher observed on rate for the locked Y variant 
indicates a 200-fold higher fraction of hinge 
in state Y compared with the original hinge. 
Assuming that the locked Y variant is 100% in 
state Y and assuming that the microscopic rate 
constant ky is identical for the locked Y hinge 
and state Y of the original hinge, this would in- 
dicate that the original hinge is 99.5% in state X 
and 0.5% in state Y at equilibrium. 

Having established the edge cases of locked 
state X and locked state Y, we sought to tune 
the pre-equilibrium by introducing single point 
mutations expected to specifically stabilize 
one state over the other while not directly af- 
fecting the peptide-binding interface. We used 
proteinMPNN to generate consensus sequences 
(38) for each state and identified non-interface 
positions with distinct residue preferences 
that were different between both states (Fig. 
5C and fig. SI6A). We experimentally tested 
individual protein variants carrying substitu- 
tions expected to stabilize one state over the 
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other without disrupting either conformation, 
as evaluated by AF2 predictions. Consistent 
with coupling of the conformational and bind- 
ing equilibria, substitutions based on state X 
consensus sequences led to weaker peptide 
binding, and those based on state Y consensus 
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sequences led to stronger binding (Fig. 5C 
and fig. S15C). The substitutions that stabilized 
state Y showed accelerated association kinetics 
(Fig. 5C and fig. S17), consistent with our kinetic 
model (Fig. 4B and fig. $16, B and C; Supple- 
mentary Note 2): the mutations effectively shift 
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Fig. 5. Controlling the conformational pre-equilibrium affects peptide binding. (A) (Left) Schematic 
representation of a hinge containing two cysteine residues that can form a disulfide bond in state X but not 
in state Y, effectively locking the hinge in state X under oxidizing conditions. Upon addition of reducing 
agent DTT the disulfide bond is broken and the conformational equilibrium is restored. (Right) FP-based 
titration of 1 nM TAMRA-labeled peptide and a hinge with state X disulfide (red, orange) or the parent hinge 
without cysteines (blue, green) under oxidizing (blue, red) or reducing (green, orange) conditions. 

(B) (Top left) schematic representation of a hinge that is disulfide-locked in state Y; (Top right) time course 
after mixing 2 nM TAMRA-labeled peptide and 50 nM locked hinge (red) or original hinge without cysteines 
(blue) fitted with a single-exponential equation (black line); (Bottom) apparent rate constants obtained from 
single-exponential kinetic fits plotted against absolute hinge concentrations (circles) and fitted with a linear 
equation (black line). (©) Tuning the pre-equilibrium with point mutations. (Top left) Models of hinge cs221 in 


both states highlighting positions of point mutations. ( 


op right) Dissociation constants (Kp) and observed 


binding rate constants (kon). (Bottom left) FP-based titration of 0.1 nM (yellow, green, blue) or 1 nM (pink, 
red) TAMRA-labeled peptide cs221B and varying concentrations of hinge variants containing one or two point 
mutations. (Bottom center) Apparent rate constants obtained from single-exponential kinetic fits plotted 
against absolute hinge concentrations (circles) and fitted with a linear equation (black line). (Bottom right) 
DEER distance distribution for the double mutant cs221-V111L-A114T in absence of peptide (gray) in 
comparison to the original cs221 with (orange) and without (blue) peptide. Vertical lines serve as a guide 


to the eye indicating state X and state Y distances. 
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the conformational pre-equilibrium toward 
state Y, increasing the on rates. This close 
coupling of the conformational equilibrium 
with association kinetics further supports 
the model outlined in Fig. 4B, and the fine 
tunability should be useful in downstream 
applications. 

The state Y-stabilizing double mutant cs221_ 
V111L_A114T has an on rate that is 22 times 
higher than that of the original cs221, suggesting 
that the occupancy of state Y in cs221_V111L_ 
All4T is 22 times higher in the absence of 
peptide. Distance distributions obtained from 
DEER measurements on site pair 2 of the 
double mutant cs221_V111L_A114T in absence 
of the peptide indeed showed an additional 
peak at a distance closely matching state Y 
(Fig. 5C and fig. S18). DEER measurements 
on site pair 1 of the double mutant showed a 
broader distribution with occupancy in the 
region corresponding to state Y (Fig. 5C and 
fig. S18). Measurements in the presence of 
the peptide were virtually indistinguishable 
from the original cs221 (fig. S18). The double 
mutant thus populates two distinct states in 
the absence of the effector, and collapses to 
one state upon effector addition (Fig. 5E and 
fig. S18). The observation of a significant state 
Y population at equilibrium in the absence 
of the peptide as predicted based on the ki- 
netic measurements further corroborates that 
the mutations affect the conformational pre- 
equilibrium and provides strong support for 
our quantitative two-state model of the kinetics 
and thermodynamics of the designed hinge- 
effector systems. 


Conclusion 


Our hinge design method generates proteins 
that populate two well-defined and structured 
conformational states rather than adopting a 
heterogeneous mixture of structures and should 
be broadly applicable to design of functional 
proteins. Like transistors in electronic circuits, 
we can couple the switches to external outputs 
and inputs to create sensing devices and in- 
corporate them into larger protein systems 
to address a wide range of outstanding design 
challenges. Hinges containing a disulfide that 
locks them in state X couple the input “red/ox 
state” to the output “target binding,” where 
the target can be a peptide or a protein, and 
our FRET-labeled hinges couple the input 
“target binding” to the output “FRET signal.” 
Our approach can be readily extended such 
that state switching is driven by naturally oc- 
curring rather than designed peptides: recently 
designed extended peptide binding proteins 
(39) resemble the state X of our hinges, and 
recent designs that bind glucagon, secretin, or 
neuropeptide Y (40) resemble the state Y of our 
hinges. Hinges based on such designs could 
thus provide new routes to applications in sens- 
ing and detection. 
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Stimulus-responsive protein assemblies that 
switch between two well-defined shapes or 
oligomeric states in the presence of an effector 
can now be built by incorporating the hinges 
as modular building blocks, which was not 
possible with the previous LOCKR switches 
as one of the LOCKR states is disordered. In- 
stalling enzymatic sites in hinges such that 
substrate binding favors one state and product 
release favors the other state should enable 
fuel-driven conformational cycling, a crucial 
step toward the de novo design of molecular 
motors. More generally, the ability to design 
two-state systems, and the designed two-state 
switches presented here, should enable protein 
design to go beyond static structures to more 
complex multistate assemblies and machines. 
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BIOPHYSICS 


Alcanivorax borkumensis biofilms enhance oil 
degradation by interfacial tubulation 


M. Prasad'+, N. Obana*’, S.-Z. Lin*, S. Zhao’, K. Sakai>-®, C. Blanch-Mercader’, 
J. Prost”, N. Nomura’*", J.-F. Rupprecht**, J. Fattaccioli*®*, A. S. Utada’>* 


During the consumption of alkanes, Alcanivorax borkumensis will form a biofilm around an oil droplet, 
but the role this plays during degradation remains unclear. We identified a shift in biofilm morphology 
that depends on adaptation to oil consumption: Longer exposure leads to the appearance of dendritic 
biofilms optimized for oil consumption effected through tubulation of the interface. In situ microfluidic 
tracking enabled us to correlate tubulation to localized defects in the interfacial cell ordering. We 
demonstrate control over droplet deformation by using confinement to position defects, inducing 
dimpling in the droplets. We developed a model that elucidates biofilm morphology, linking tubulation 
to decreased interfacial tension and increased cell hydrophobicity. 


bligately hydrocarbonoclastic bacteria 

(OHCB) are a group of cosmopolitan 

marine bacteria with an unusual ecology: 

They can survive by consuming hydro- 

carbons as a sole carbon and energy 
source (1). These metabolic specialists are found 
at very low densities because of the lack of 
hydrocarbons but are thought to play a global 
role in metabolizing naturally occurring al- 
kanes (2) in the ocean. However, they become the 
dominant bacteria, out-competing generalists, 
at the site of oil spills (7, 3, 4). OHCB are thought 
to degrade a substantial fraction of spilled oil 
worldwide (J, 5), which has generated interest 
for their potential as agents of bioremediation 
(5-9). 

Alcanivorax borkumensis SK2 is an aerobic 
and rod-shaped OHCB (J0) that is often used 
as a model organism for prevalence (/, 2) and 
its genetic tractability (6, 17). Like most bac- 
teria, it transitions between planktonic and 
biofilm lifestyles, which is now recognized as 
integral to bacterial biology (72). Biofilms are 
often dense three-dimensional communities en- 
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cased in self-secreted extracellular polymeric 
substances, which adhere them to surfaces. 
Biofilms begin with the colonization of a sur- 
face. This can lead to high two-dimensional cell 
densities, which in the case of rod-shaped bac- 
teria, induces nematic liquid crystal order. Bac- 
teria have been shown to escape confinement 
in dense packings (13, 14) by leveraging regions 
where the nematic order is undefined, called 
topological defects (15). Unlike most bacteria, 
because A. borkumensis forms biofilms on a 
liquid, it is not clear how the interfacial fluid- 
ity affects cell packing and biofilm morphol- 
ogy formation during oil consumption. 

Most knowledge of bacteria-mediated oil deg- 
radation comes from chemical, genetic, and 
metagenomic analysis of ocean samples (7, 16, 17) 
and microcosm tests that used crude oil and sea 
water (18-20). Recent work has begun to clarify 
the important initial step of bacterial coloniza- 
tion (27) and to characterize biofilm formation 
(22-25) on oil drops in the size range com- 
monly found dispersed in the ocean after an oil 
spill. Biofilms can deform oil drops (8, 24, 25) 
and increase their hydrodynamic drag (23), po- 
tentially mediating the formation of organic-oil 
aggregates. These aggregates, also known as 
marine oil snow, have been identified as a mech- 
anism by which large quantities of partially 
biodegraded oil was transported to the seafloor 
in the aftermath of the Deepwater Horizon 
disaster (4, 23, 26). However, the mechanism 
of biofilm formation, which depends on the 
interfacial properties of individual cells, and 
its relation to oil degradation remains un- 
clear (18). 


Experimental setup and microfluidic device 


To address these questions, we developed a 
microfluidic device that allows the trapping 
and real-time imaging of numerous bacteria- 
covered oil droplets. This platform allows us to 
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ment starting from individual bacteria i$ 


the complete consumption of oil droplets. 

In the ocean, A. borkumensis subsists pri- 
marily on naturally occurring organic acids and 
alkanes (2); however, during oil spills, it blooms 
to exploit the hydrocarbons in the crude oil 
mélange. To study the biofilm dynamics as 
A. borkumensis adapts (27) to using solely al- 
kanes, we initially cultivated using pyruvate 
and then switched to an artificial seawater me- 
dium supplemented with hexadecane (C16) 
(fig. SLA). We sampled bacteria from liquid 
cultures cultivated up to 5 days by harvest- 
ing the cells and generating cell-laden oil 
microdroplets with fresh C16, which we incu- 
bated in individual microfluidic traps (supple- 
mentary text S1). Our device, which is highly 
permeable to oxygen, facilitates in situ cul- 
turing, enabling the longitudinal investigation 
of biofilm development simultaneously on nu- 
merous droplets (Fig. 1A and fig. S2A). Trapped 
drops initially had approximately 20 to 50 cells 
attached and reached confluency in ~12 hours, 
which we define as to (fig. S3A). 


Bacteria exhibit two distinct phenotypes and 
consumption rates 


Testing different liquid cultures at 1 and 5 days 
revealed different phenotypes that manifested 
vastly different biofilm morphologies and oil 
consumption rates. When sampled after 1 day, 
A. borkumensis formed a spherical biofilm (SB) 
that grew outward from the oil, and the oil 
droplet remained mostly spherical as it was 
consumed (Fig. 1B, fig. S3B, and movie S1). 
By contrast, bacteria sampled from 5-day-old 
culture can develop into thin biofilms with a 
local nematic ordering of the interfacial cells, 
which eventually buckle and then tubulate 
(24, 25) the droplet interface (Fig. 1C), which 
we call dendritic biofilms (DB) (supplementary 
text $2). The biofilm buckles the interface to 
accommodate a population that is continuously 
increasing in number. The magnitude of the 
deformations grows as the oil is consumed, 
which ultimately shreds the droplets into tiny 
fragments (fig. S3C and movie S82). 

To quantify the oil consumption rate for 
each phenotype, we measured the oil volume 
(V) over time using confocal microscopy. SBs 
consumed >90% of their oil droplet volume 
in ~72 hours, whereas DBs achieved the same 
level of consumption in ~20 hours. We found 
that the oil volume of SBs decreases as a poly- 
nomial function of time, whereas for DBs, the 
decay is much faster (Fig. 1, D and E, and fig. S3, 
D to F). In both cases, the decrease is consistent 
with a model of oil consumption effected ex- 
clusively by bacteria at the interface (supple- 
mentary text S3). 

The good agreement between our analytical 
models and data allows us to estimate the 
single-cell consumption rates of oil by the SB and 
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Fig. 1. Spherical and dendritic biofilm phenotypes 
on oil drops in a microfluidic trap. (A) (Left) 
Schematic of the microfluidic oil-drop trap device, 
showing media-filled channels. Oil drops are 
trapped in the raised circular regions. Inlets on 
either side of the drop chamber connect to 
reservoirs that provide a gentle flow of media 
through the trap chamber. The white circles indicate 
pillars. Scale bar, 200 um. (Middle) Schematic 
cross-section of an individual trap. (Right) 
Bright-field image of a representative drop in a 
trap. The trap is outlined with a white dashed line, 
and the pocket is colored orange as a guide for 
the eye. Scale bar, 20 um. (B) Representative 
time-lapse sequence of confocal images showing 
the development of the SB phenotype. As the 
biofilm (green) grows, the oil droplet (central 
void) shrinks. Scale bar, 10 um. (©) Maximum 
intensity projection confocal images 

showing the development of the DB 

phenotype on representative drops. Local 
nematic order is present at confluence 

(~0 hours), buckling (0.1 to 0.5 hours) and 
protrusions (2 to 4 hours) appear later, 

and large-scale remodeling of the interface 
eading to the formation of tubes occurs 

much later (16 hours). Scale bar, 10 um. 

(D and E) Normalized surface area (S*) and 
volume (V*) of oil drops as a function of 

time for (open circles) SBs and (stars) DBs. 
Solid lines and shaded regions indicate 

mean + SD (nsg = 11 drops, and npg = 

12 drops; representative experiment from 
three or more independent tests) (additional 
data is provided in fig. S3). Solid symbols 
indicate measurements, and open symbols 
indicate values calculated from measured 
quantities. The dashed and dotted lines 

are best fits of our analytical models 

of oil degradation for the phenotypes. 

{(D), inset] Normalized drop radius (R*) 

was used to estimate S* and V* for 

SBs. [(E), inset] S*/V* as a function 

of time. R, S, and V are normalized by 

their initial values, respectively. We used 

eqs. S5, S6, S9, and S11 (supplementary 

text S3) to fit R*, S* and V* for SB 

and DBs, respectively. 


DB cells as 0.7 and 0.8 fl/hour, respectively. 
This small difference in consumption rate is 
consistent with their similar division times 
(fig. SID). For comparison, the volume of a 
single cell is ~1 fl, meaning that these bacteria 
consume a volume of oil close to their own, 
every hour. Despite the similarity in consump- 
tion rates on a per-cell basis, the normalized 
surface-to-volume ratio (S*/V*) shows that DBs 
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are considerably more efficient: S*/V* doubles 
in 72 hours for SBs, whereas it diverges in less 
than 24 hours for DBs (Fig. 1E, inset). The S*/V* 
ratio provides a means for comparing the rel- 
ative efficiencies of the two phenotypes and 
highlights that these differences arise from the 
rapid increase in interfacial area caused by DB 
biofilms. In both cases, the shape of the interface 
defines the dynamics of volume decrease. For 
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SBs, the interfacial area defined by the sphe- 
rical droplet determines the number of cells 
(N) that can pack onto the interface to have 
access to the oil. Conversely, for DBs, it is the 
increase of N at the interface due to cell divi- 
sion that determines the interfacial area. Thus, 
the rate of consumption decreases over time 
for SBs, whereas it increases continuously 
for DBs. 
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Tubulation is facilitated by topological defects 
We correlated the onset of the rapid increase 
in surface area for DBs to the emergence of ne- 
matic order of the interfacial cells. Active sys- 
tems with nematic order have shown the 
ability to use topological defects to achieve 
surface deformation, both theoretically (28, 29) 
and experimentally (supplementary text S4) 
(13, 14, 30). Nematic topological defects of 
charge +1/2 and +1 are shown schematically 
in fig. S4A. At 2 to 4 hours after confluency, we 
observed the appearance of conical protrusions 
of cells that originate from the core of aster (+1) 
topological defects in the nematic director field, 


distance [1m] 


which is the average orientation of the bacteria 
(Fig. 2, A and B; fig. S4B; and supplementary 
text S3). As the biofilm matures, more protru- 
sions appear, while existing protrusions elongate 
into branched bacteria-covered tubes (Figs. 1C, 
16 hours, and 2C). 

Differential labeling of the oil and cells shows 
that the tubes are not filled with water; instead, 
they are filled with oil (fig. S5A). This indicates 
that cell adhesion to the oil stabilizes the tubes 
against collapse, preventing the deformed oil 
from regaining a spherical shape. Furthermore, 
careful inspection of the confocal images of the 
tubes reveals that the bacteria are well aligned 


0 100 


200 


300 


time [min] 


400 500 


Fig. 2. Dendrites originate from topological defects. (A and B) Confocal images of representative 
droplets (A) early (~O hours) and (B) later (~12 hours) in biofilm development. The images are color coded 
by depth. The dashed circles enclose +1 topological defects, and the arrows indicate protrusions. [(B), inset] 
Magnified view of the central defect circled in yellow. (€) Confocal image of a bacteria-covered tube 
connected to a deformed droplet with corresponding orthogonal views (~20 hours). (D) Director field of 
the visible cells in the dashed box in (C). The director field (yellow lines), tube axis (red line), the local 
tangent unit vector (Mype) along the tube axis, and the bin window (blue) are shown. (E) Axial order of the 
cells along the oil tube shown in (D). For a sliding window of width 1.5 wm along the tube, the local axial order 
is defined as the average of the individual scalar products between the director-field unit vectors (Ie) 
and local tangent unit vectors. (Mel) * Mupe) = 1 for parallel alignment and O for perpendicular alignment. Solid 
line and shaded region are mean + SD. (F) Oil tube length (/tupe) plotted as a function of time (n = 18 tubes; 
three independent tests). The dashed line is a fit to the average tube length from using an exponential 
equation [adjusted coefficient of determination (R*) = 0.96]. Scale bars, 10 wm. 
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to the tube axis, which becomes clear in the 
director field of the cells on the tube (Fig. 2D). 
We characterized cell alignment along the tube 
with the local average scalar product between 
the director field and the tube axis, finding 
values close to 1; this indicates a high degree of 
alignment between the cells and the tube axis 
(Fig. 2E and fig. $5, B and C). 

Because of this alignment, we hypothesized 
that the rate of increase of the tube length is 
proportional to the number of cells on the tube, 
which should increase exponentially if all cells 
were to divide. We measured tube elongation 
on different droplets and found that it increases 
rapidly and in a manner consistent with ex- 
ponential elongation, which supports this hypo- 
thesis (Fig. 2F and fig. S5D). Furthermore, from 
the fit to our data, we extracted a tube length- 
doubling time of ~3.4 hours, which is twofold 
greater than the cell division time (tgj,) of 
1.65 hours (fig. SID). This difference likely 
arises from the imperfect alignment of cells 
along the tubes and the expulsion of cells from 
the interface, which is visible around the tubes 
(Fig. 2C). 


Biofilm phenotypes are associated with a 
decrease of interfacial tension 


The large difference in biofilm morphology be- 
tween the two phenotypes suggests differences 
in their interfacial properties. A. borkumensis 
secretes amphiphilic molecules that are thought 
to aid in the assimilation of oil (0, 37-33). Fur- 
thermore, autolysis is thought to be important 
during A. borkumensis biofilm formation (1D), 
which could liberate membrane-bound bio- 
surfactants into the oil. These molecules can 
lower the oil-water interfacial tension (y), mak- 
ing interfacial deformation easier. To measure 
changes in interfacial properties of the pheno- 
types, we fractionated SB and DB liquid cultures 
into three components—cells, conditioned me- 
dia, and conditioned C16—to independently 
measure y for each (fig. S6A). We found that 
y for each of the respective fractions was de- 
pressed relative to control values; however, 
the DB-conditioned oil decreased the most. The 
y for DB-conditioned oil decreased from 32 to 
7 mN/m and was about half the value for SBs 
(Fig. 3A; fig. S6, B to D; and table SI). Unexpec- 
tedly, when we microfluidically tested SB and 
DB cells using the DB-conditioned C16 instead 
of fresh C16, we found no change in observed 
phenotype (fig. S6E). Thus, despite the ~80% 
lower y of DB-conditioned oil, the SB cells were 
unable to deform the oil-water interface, indi- 
cating that a lower y is not sufficient to produce 
the DB phenotype. 

Interfacial behavior is also affected by cell 
hydrophobicity, which together with y controls 
the extent that oil wets the cells (fig. S7A). Cell 
hydrophobicity is thought to increase the longer 
that cells consume oil, potentially through ac- 
cumulation of membrane-bound hydrophobic 
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biosurfactants (33); however, the relationship 
with phenotype is unclear (19, 34). Although 
the bacteria appear to lay flat on the surface at 
tp, the contact angle is difficult to estimate from 
confocal images (fig. S7B). Thus, to estimate cell 
hydrophobicity of both phenotypes, we mea- 
sured the three-phase contact angle (0) between 
a water drop deposited on a bacterial lawn sub- 
merged in oil (fig. S7, C to E) (35); larger 6 values 
indicate greater hydrophobicity. SB cells, which 
were isolated after 1 day of culture, had a 0 = 
80° whereas DB cells, which were isolated after 
5 days, had a @ = 100° (Fig. 3B). This suggests 
that the midplanes of SB and DB cells were 
+10% above and below the interface, respec- 
tively (fig. S7A). 

The higher hydrophobicity of DB cells also 
indicates that they have a larger interfacial ad- 
hesion strength than that of SB cells, for a con- 
stant y (supplementary text S4). To directly 
compare the respective adhesion strengths 
of SB and DB, we forced them to compete for 
interfacial area on oil microdroplets, noting 
that both phenotypes have similar division 
times (fig. SID). We generated cell-laden drop- 
lets using mCherry-expressing SB cells and 
green fluorescent protein (GFP)-expressing DB 
cells in a 3:1 ratio and recorded fluorescence 
intensity as the biofilm developed. In this test, 
both phenotypes experienced the same y. We 
found that although the DB cells were ini- 
tially in the minority, they dislodged the estab- 
lished SB, becoming dominant in ~5 hours 
(Fig. 3, C and D). This confirmed our expec- 
tation that DB cells, which are more hydro- 
phobic, do indeed have a larger adhesion energy 
to the interface than that of SB cells. 

Rod-shaped gammaproteobacteria such as 
Pseudomonas aeruginosa have been shown to 
colonize and remodel oil-water interfaces with 
their biofilms, similar to what we observed in the 
early stages (<3 hours) of DB formation (36, 37). 
However, those biofilms lack the large-scale de- 
formations produced by DBs (Fig. 1C, >5 hours). 
Using our y, we estimated the biofilm compres- 
sion modulus to be ~200 Pa (38), which is much 
smaller than the ~1 MPa growth pressure of 
P. aeruginosa (39). Assuming a similar growth 
pressure for A. borkumensis, cell division sup- 
plies enough stress to easily deform the inter- 
face. Thus, DBs generate the biofilm phenotypes 
we observed only if enough cells remain adhered 
to the interface to drive the tubulation process. 
Conversely, the lack of deformations in the SB 
phenotype is the consequence of insufficient 
cell hydrophobicity combined with a stiffer 
interface, leading to cell detachment followed 
by biofilm formation around the droplet. 


Membrane theory predicts the transition from 
SB to DB phenotype 


On the basis of these observations, we devel- 
oped a coarse-grain membrane model to de- 
scribe the interfacial dynamics of the growing 
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Fig. 3. Theoretical model of tubulation. (A) Interfacial tension (y) between biofilm-conditioned oil and 
fresh media measured with pendant drop tensiometry for SBs and DBs, respectively. The dashed line 
indicates y between fresh oil and fresh medium [mean + SD; nsg = 3, Npg = 10, Neontroi = 5 independent tests; 
*P < 0.05 (P = 0.012), Welch's t test with Holm-Bonferroni correction]. (B) Three-phase contact angle 

(8) between a water drop deposited on a bacterial lawn of SB and DB phenotypes, submerged in oil. Open 
squares indicate mean; horizontal dashed lines indicate medians; and the solid circle indicates an outlier 
[nsp = 10, npg = 13 independent tests; *****P < 0.00001 (P = 1.2 x 10°), Welch's t test]. (Inset) Images of 
water drops on bacterial lawns, submerged in oil. (©) Time-lapse sequence of the competition between SB 
cells (magenta) and DB cells (green) for interfacial area on a trapped droplet. Droplets are generated in a 
suspension containing SB and DB cells at a ratio of 3:1 (supplementary text). DB cells displace a monolayer of 
SB cells over the course of ~6 hours. (D) Measurement of the fractional coverage of SB and DB as a function 
of time from confocal images (solid line and filled regions indicate mean + SD; n = 10 representative drops). 
(E) Oscillatory biofilm (OB) behavior demonstrated by a culture sampled for an intermediate duration 
between the SB and DB cultures. (F) Model of the tension (i, in the phase-field model) experienced by the tip 
of a tube as a function of normalized interfacial cell density (p/p) (supplementary text). We normalized cell 
density by the homeostatic interfacial cell density (py). Positive and negative values of «, indicate tube 
retraction and expansion, respectively. The slope at the y intercept is k; = 3, 6, and 10 for the SB, OB, and DB 
models, respectively. The pg is the critical buckling density, where « < 0. For SBs, pg = ©. The ps is the 
optimal cell density, where «; reaches a minimum, and beyond which the effect of spontaneous curvature is 
reduced. (Insets) Phase-field simulation showing a circular droplet (1; > 0) and a tubulated droplet (x; < 0). 
(G) Schematic phase diagram of the biofilm phenotypes in terms of normalized densities: ps/py and pp/py. 
When ps/px > 1, SBs form because the interfacial cell density can never become sufficiently large to induce 
buckling. For pg/py < 1, cell division drives an increase in cell density beyond pg. When ps/py < 1, 
oscillations between the spherical and dendritic phenotypes can occur (OB). When, ps/px > I, stable 
dendrites (DB) occur. 
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interfacial biofilm. The model explains the tran- 
sition between SB and DB phenotypes in terms 
of a competition between the interfacial ten- 
sion and the spontaneous curvature of the bio- 
film, which is its intrinsic tendency to bend in 
a preferred direction. Interfacial tension resists 
expansion of the surface by the biofilm, whereas 
the spontaneous curvature of the biofilm governs 
the shape of the expanding surface. Tubes are 
generated when the energetic cost of increas- 
ing surface area is lower than the cost of bend- 
ing; they expand exponentially, according to 


=) 


where L is the tube length, n is the viscosity 
of the biofilm layer, o;7 and op, are the re- 
spective tensions along the circumferential and 
normal directions of the tube, «gp is the bending 
rigidity of the membrane in the circumferential 
direction of the tube, 1/79 is the spontaneous 
curvature of the biofilm, and req is the tube 
radius (supplementary text S6 and fig. S14). 

Equation 1 encompasses the existence of ac- 
tive extensile nematic stresses driven by bacte- 
rial growth, with an effective interfacial tension 
Yer(P) = Ya(Oxzr + Gps) that depends on the 
interfacial cell density (p) (supplementary text 
S6). This implies that tubulation occurs at a 
critical value of the buckling density (pg), which 
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Fig. 4. Defect-mediated buckling of the surface 
of a confined droplet. (A to C) Time-lapse confocal 
image sequence showing the evolution of a dendritic 
biofilm on a “flattened” drop, color coded by depth. 
(A) Before confluency (—40 min), interfacial tension 
excludes cells from the flattened regions at the top 
and bottom of the droplets. These regions are 
generated where the nonwetting droplets contact the 
glass floor and polydimethylsiloxane (PDMS) ceiling. 
(Top inset) A droplet schematic. (Bottom inset) 
Top view of the droplet. (B) Confluent monolayer is 
formed at to = 0 min (fig. S10). (Inset) Top view of 
droplet, with a circle enclosing the location of the 
nematic defect. (C) Dimple formation in the droplet 
at the defect. Scale bars, 10 um. (D) Evolution of the 
dimple height (h) from xy-line profiles measured 
across the droplet midplane, at 20 min (early) and 
26 + 3 min (late) after confluency. Dimple height 

is shown as a function of the radial coordinate (r). The 
fits to the data are based on eq. S80 (supplementary 
text). The solid error bars and shaded regions 
indicate SD on the experimental data and fits, 
respectively (details on the fitting procedure and 
error estimation are available in the supplementary 
text). Early-profile data are the average of the x and 
y profiles (n = 4) at t = 20 min, whereas the late- 
profile data are the average of the x and y profiles 
binned at 23, 26, and 29 min (n = 12), respectively. 
To generate the dimensional values shown, we 
rescaled the nondimensionalized h values by the 
dimple heights 4.5 and 8.0 wm, at early and late 


is consistent with our observation that tubes 
form after confluency (Fig. 1C). In our model, 
bacteria populate a circular oil-water inter- 
face with a density that increases logistically 
toward a homeostatic density (py), at which 
division and loss are balanced. Depending on 
the ratio pp/py, the phenotype changes: When 
Pp/Pu > 1, tubes are unable to form, resulting 
in the SB phenotype; when p3/py < 1, the inter- 
face buckles and stable tubes form, producing 
the DB phenotype. Our model implies that 
physically lowering p by removing cells of a DB 
below the critical value should cause a de- 
formed droplet to recover its unperturbed sphe- 
rical shape. To test this hypothesis, we infused 
surfactants into a device containing DBs. We 
used a formulation similar to that of Corexit 
9500, which is used to disperse marine oil spills, 
and concentrations that ranged from 25 to 
100 times the critical micelle concentration (37). 
Our concentrations exceed estimated levels in 
the top 20 cm of the ocean after the Deepwater 
Horizon accident (4, 37). After a ~4-hour lag, the 
biofilm abruptly washed away, and the de- 
formed droplets became spherical; this recovery 
was the consequence of positive interfacial ten- 
sion in the absence of bacteria (fig. S8, A and B, 
and movie S83). 

In addition to the SB and DB phenotypes 
described so far, we observed an intermediate 
phenotype when we isolated cells at an inter- 
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times, respectively, whereas r was rescaled by 26 um at all times. 
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mediate culture time (fig. S1A). These biofilms 
present dynamic oscillatory behavior (OB), alter- 
nating between the DBs and SBs, with a period 
of ~12 hours (Fig. 3E, fig. S9, and movie S4). The 
relatively short (~30 min) transition from tubu- 
lated to spherical is consistent with a sudden 
loss of tube stability because of a sudden increase 
in effective surface tension (Eq. 1). 

To elucidate the emergence of oscillations 
between spherical and dendritic phenotypes, we 
used a phase-field approach following an estab- 
lished literature on simulating interfacial dyna- 
mics of multiphasic systems (40). In our case, 
the simulated field is the local fraction of oil 
[~p(@,y)] that obeys a Cahn-Hilliard equation 
with an interfacial term «;(p)(V' p)°/2; here, Kx(p) 
is defined by the right side of Eq. 1 and sets 
the tube elongation rate. «,(p) is thus a proxy for 
the active stress along the interface (supplemen- 
tary text S4). In our model, a linear relation of 
the form «;(p) = y - Ayp, where #, accounts for 
the force exerted by the biofilm on the oil, is 
sufficient to understand the SB-to-DB transi- 
tion. For the SB phenotype, the «,(p) tension is 
positive for all densities, precluding tube forma- 
tion. The interface remains spherical until p 
surpasses the critical density, pg = y/k,; here, 
k;(p) becomes negative, and tubulation occurs 
as the interface buckles. However, the morpho- 
logical oscillations emerge only if we consider a 
second-order expansion of «,(p) in p, so that an 
“optimal” cell density exists where tube elonga- 
tion and the final tube length are maximal, 
which we denote pg (Fig. 3F). Oscillations arise 
for the specific case when px < Ps < py (Supple- 
mentary text S5); here, as cell density increases 
beyond the optimal value ps, tube elongation 
ceases, and contraction starts. During this con- 
traction, the surface shrinks faster than cells 
can be ejected from the interface, leading to an 
abrupt increase in bacterial density. We found 
that p overshoots py and causes «;(p) to be- 
come positive (Fig. 3F, blue). This dynamical 
overshoot ultimately induces a catastrophic 
collapse of the tubes and is accompanied by a 
large reduction in the number of cells at the 
interface. Consistent with this prediction, in 
our experiments we observed a large and per- 
sistent flow of cells away from the interface 
soon after tube collapse (movie S4). Our bio- 
mechanical model recapitulates the transi- 
tion from these three phenotypes in terms of 
a tubulation mechanism that depends on bac- 
terial growth dynamics, with oscillations em- 
erging through a dynamical phase transition 
mechanism (Fig. 3G and movie S5) (41). These 
oscillations are driven by continuous cell divi- 
sion that pushes density beyond the critical 
values for tube growth and collapse. 


Controlled buckling of confined droplets 


We leveraged microfluidics to position the tube- 
generating topological defects through droplet 
confinement. By trapping droplets larger than 
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the chamber height, we can grow biofilms on 
flattened drops (Fig. 4A, top inset). These flat 
circular regions allowed us to generate a cen- 
tered defect, which concentrates growth stress 
to form a dimple. Interfacial tension, which ini- 
tially excludes bacteria from the flat region, 
was eventually overcome by growth pressure 
(Fig. 4A, fig. SIOA,B, and movie S6). As cells 
invaded, their flow oriented the director field, 
which formed a single aster defect (or two nar- 
rowly separated +1/2 defects) at confluency 
(Fig. 4B and fig. S10, C and D) (15). As cell divi- 
sion continued, dimples formed at the defects 
at the top and bottom of the drop (Fig. 4C and 
fig. S10, B, inset, and E). We used the theory of 
liquid crystal membranes to elucidate the role 
of the aster defect in generating deformation in 
the biofilm. The dimple height profile depends 
on two dimensionless parameters: «/p, Where 
kf is the elastic constant of distortions to the 
director field, and Kp/yRaimpie » Where Raimpie is 
the radius of the dimple (supplementary text 
S8). «p/Kpz captures the competition between 
the elastic energy of distortions to the liquid 
crystalline biofilm and its bending energy, 
whereas Kp/ yRaimple- captures the competition 
between biofilm bending energy and its sur- 
face tension. Buckling at the defect occurs when 
the energetic cost to deform the membrane 
exceeds its bending energy, «;/kg > 1; our fits 
yield a ratio of ~2 (fig. S11 and table $2) (29, 42). 
In agreement with experiments, the profiles in 
this parameter regime are qualitatively like 
pseudospheres (Fig. 4D and fig. S12) (42). 

The ability of marine microorganisms to bio- 
degrade hydrocarbons has been recognized for 
nearly a century (J), with recent improvements 
to metagenomic and imaging techniques further 
clarifying the possible mechanisms involved. In 
this study, we quantified the mechanism by 
which cultures of A. borkumensis preadapted 
to using an alkane leads to the appearance of 
a biofilm phenotype primed (43) for explosive 
growth. Leveraging microfluidics, we captured 
the dynamics of the optimized biofilms, correlat- 
ing their appearance to measurable changes in 
the interfacial properties of the bacteria. These 
biofilms develop liquid crystalline order before 
buckling the interface at aster topological de- 
fects, morphing the spherical droplet into num- 
erous branching dendrites. We developed a 
theoretical model that recapitulates each phe- 
notype and explains the cause of the spectacular 
biofilm oscillations we observed. Furthermore, 
we used microfluidics to control the dimpling 
of the droplet and used the profiles to confirm 
the liquid crystalline elastic constant of the bio- 
film from theory. 

Topological defects in layers of both prokar- 
yotic and eukaryotic cells have been shown to 
be important in defect-driven morphogen- 
esis (13, 14, 29, 30, 44). We found that the 
optimized A. borkumensis cells collectively 
use defect-driven buckling to escape the con- 
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fines of the droplet interface and expand the 
biofilm laterally. Efficiency is achieved not 
through an increase of individual metabolic 
throughput but rather by expanding the inter- 
face, allowing more cells to simultaneously feed. 

Although we found that addition of oil dis- 
persant similar in composition to commercially 
available mixtures leads to the rapid detachment 
of the biofilm from the oil drops, numerous fac- 
tors such as the dispersant concentration; oil 
composition, temperature, and pressure; and 
nutrient concentrations likely affect biodegra- 
dation. Because our platform is an open sys- 
tem, all inputs can be dynamically controlled; 
thus, how biofilm formation changes depend- 
ing on hydrocarbon composition, nutrient pro- 
file, as well as the dynamical response of bacteria 
to dispersant dose may be independently veri- 
fied. These tests could also serve as a starting 
point in the investigation of artificially con- 
structed multispecies consortia (8), which are 
more robust and effective than monocultures (9). 
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but it is classified as a “quasi-Wolf-Rayo 


A massive helium star with a sufficiently strong 
magnetic field to form a magnetar 
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Helge Todt?°, Silvia Toonen? 


Magnetars are highly magnetized neutron stars, the formation mechanism of which is unknown. Hot 
helium-rich stars with spectra dominated by emission lines are known as Wolf-Rayet stars. We 
observed the binary system HD 45166 using spectropolarimetry and reanalyzed its orbit using 
archival data. We found that the system contains a Wolf-Rayet star with a mass of 2 solar masses 
and a magnetic field of 43 kilogauss. Stellar evolution calculations indicate that this component 
will explode as a supernova, and that its magnetic field is strong enough for the supernova to leave a 
magnetar remnant. We propose that the magnetized Wolf-Rayet star formed by the merger of two lower- 


mass helium stars. 


eutron stars form in supernovae by the 
collapse of stellar cores that exceed the 
Chandrasekhar mass limit [mass M > 1.4 

solar masses (M.)]. Bare stellar cores 

can be exposed as hot, evolved helium- 

rich stars that have shed their outer hydrogen- 
rich layers. A subset of these massive helium 
stars is observed as Wolf-Rayet stars, which 
have spectra dominated by broad emission 
lines produced by strong stellar winds (J, 2). 
Massive helium stars (those with M = 1.4Mo.) 
are thought to be stripped products of more 
massive stars that lost their hydrogen-rich en- 
velopes through stellar winds, eruptions, or 
interactions with a binary companion (3, 4). Alter- 
natively, massive helium stars could be produced 
through the merger of lower-mass objects (5). 
Approximately 10% of young neutron stars 
have magnetic fields >10'* G (6). These are 
known as magnetars; their origin is debated 
(7, 8). One formation scenario invokes fossil 
magnetic fields rooted in the massive core of 
a progenitor star that collapses during a super- 
nova (9). About 7 to 10% of massive main- 
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sequence stars have strong (several kG) large- 
scale surface magnetic fields (10, 11); such 
stars could be progenitors of magnetars. How- 
ever, corresponding magnetic fields have not 
been detected in evolved massive stars (12). 
Strongly magnetic low-mass helium stars have 
been observed (13-15), but not massive mag- 
netic helium stars exceeding the Chandrasekhar 
limit. 

The HD 45166 binary system (also cataloged 
as ALS 8946) comprises a main sequence star 
(classified as spectral type B7 V) with a hot 
stellar companion. The hot companion has a 
spectrum dominated by the characteristic 


(qWR) star owing to its peculiarly narrow emis- 
sion lines (widths of hundreds of kilometers 
per second instead of the typical thousands); 
spectral variability; and the anomalous pres- 
ence of strong carbon, oxygen, and nitrogen 
lines in its spectrum. Previous radial velocity 
(RV) measurements have shown a 1.6-day pe- 
riodicity in the velocity of the B7 V compo- 
nent, which was interpreted as the orbital 
period of the system (6). This implied a mass 
of 4.2 + 0.7Mo for the Wolf-Rayet component 
and a pole-on orbital configuration (inclina- 
tion 7 = 0.7°) (16). That mass is well below the 
typical masses of Wolf-Rayet stars in the Milky 
Way [M = 8M, (7)]; no other massive helium 
stars are known with M < 8M. (8). Compar- 
ison with stellar atmosphere models that do 
not assume local thermodynamic equilibrium 
(non-LTE models) has shown that the Wolf- 
Rayet component is a hot (surface effective 
temperature 7, = 70 kK), helium-rich star 
with enhanced nitrogen and carbon contents 
(compared with the composition of the Sun) 
(79). A latitude-dependent wind model can re- 
produce the spectrum of the Wolf-Rayet com- 
ponent (19); however, the implied mass-loss 
rate is orders of magnitude greater than pre- 
dictions for helium stars of this mass (20-22). 

The presence of a carbon and nitrogen 
emission-line complex in the range of 4430 to 
4460 A (known as the Of?p phenomenon) 
and strong spectral variability are typical 
indirect signatures of magnetism in hot 
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Fig. 1. Indications of possible magnetism in HD 45166. Solid blue and thin dashed black lines indicate two 
optical HERMES spectra of HD 45166 (25) taken =70 days apart (see legend). The thick black line indicates the 
continuum level. Labels indicate the Of?p phenomenon (23, 24) and the He 1! line at 4686 A, which is variable 
(also seen in Fig. 3). These are common features of hot magnetic stars, which indicates that the Wolf-Rayet 
component might be magnetic. The narrow Mg 1! ion line at 4481 A is due to the B7 V component. 
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Fig. 2. Spectropolarimetry of HD 45166. (A) to (|) shows the intensity 
spectrum (Stokes /, black lines), diagnostic null spectrum (N, blue lines), and Stokes 
V spectrum (red lines) from the co-added ESPaDOnS spectrum. The N and V spectra 
have been vertically shifted and scaled for display purposes, by the factors listed in 


stars (23, 24). We used spectropolarimetry to 
investigate whether the Wolf-Rayet component 
in HD 45166 is magnetic. 


Observations of HD 45166 


We collected eight spectropolarimetric obser- 
vations of HD 45166 in February 2022 using 
the Echelle Spectropolarimetric Device for the 
Observation of Stars (ESPaDOnS) spectro- 
polarimeter on the Canada-France-Hawaii 
Telescope (CFHT) (25). The spectra measure 
the polarimeteric Stokes parameters J and V 
over the wavelength range 3668 to 10,480 A at 
a resolving power of 65,000. We used them to 
measure the magnetic-field strength in the 
Wolf-Rayet and B7 V components of HD 45166. 

We used archival spectra from three other 
instruments to measure RVs over a longer 
period (25). They were 103 spectra obtained 
with the Coudé spectrograph at the 1.6 m tel- 
escope of Laboratorio Nacional de Astrofisica 
(LNA), mainly covering 4520 to 4960 A; 36 
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spectra obtained with the Fiber-fed Extended 
Range Optical Spectrograph (FEROS) at the 
1.52 m telescope of the European Southern 
Observatory (ESO), covering 3830 to 9215 A; and 
28 spectra acquired with the High-Efficiency 
and High-Resolution Mercator Echelle Spec- 
trograph (HERMES) on the 1.2 m Mercator 
telescope, covering 3770 to 9000 A. We used 
additional archival ultraviolet spectroscopy 
and photometric data to analyze the spectral 
energy distribution (SED) of the system (25). 


Evidence for magnetism 


We detected circular polarization (Stokes V) in 
the ESPaDOnS spectra for the majority of lines 
associated with the Wolf-Rayet component 
(Fig. 2). We also detected Zeeman splitting 
(doubling of spectral lines owing to a magnetic 
field) in two O v (quadruply ionized oxygen) 
lines (Fig. 2, C and D) that form in or close to 
the stellar surface. We measured the magnetic 
field from the separation of the split Zeeman 
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of O v. We used these lines to measure the magnetic field strength. (I) An O | (neutral 
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components of these lines, finding a mean field 
modulus (B) qwr = 43.0 + 2.5 kG and a mean 
longitudinal field (B,) qwr = 13.5 + 2.5kG (table 
Sl). The ratio between these values (approxi- 
mately 3:1) is consistent with a dipolar mag- 
netic field viewed near the magnetic pole (26). 

Such a strong magnetic field in the Wolf- 
Rayet component implies that its emission-line 
spectrum is formed in the star’s magnetosphere 
(the region containing plasma confined by the 
magnetic field loops), not in a radially expand- 
ing stellar wind (25). If so, one-dimensional 
(1D) stellar atmosphere models [as used in 
previous studies (19)] are not appropriate for 
the emission lines, thus affecting the previ- 
ously derived physical parameters. 


Stellar parameters 


We reanalyzed the spectra and SED using 
PoWR, a non-LTE model atmosphere code op- 
timized for Wolf-Rayet stars (27, 28). Because 
the PoWR code is also 1D, we only used it to 
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analyze spectral features that originate in the 
stellar surface of the Wolf-Rayet component, 
not emission lines from the magnetosphere 
(25). The resulting physical parameters are 
listed in table S1. We found an effective tem- 
perature for the Wolf-Rayet component of 
T, = 56.0+5.0 kK, which is ~15 kK lower 
than previously determined (19). The effective 
temperature and bolometric luminosity L [we 
found log(L/Lo) = 3.830 + 0.050, where Lg is 
the luminosity of the Sun] indicate that the 
magnetic Wolf-Rayet component is not a main 
sequence star, but an evolved object. 

We applied the same analysis to the B7 V com- 
ponent (25) then input the resulting stellar 
parameters into Bonnsal, a Bayesian tool for 
comparing observations to stellar models (29). 
That analysis indicates that the mass and age 
of the B7 V component are Mp = 3.38 + 0.10 Mo 
and 105 + 35 million years (Myr), respectively 
(table S1). 

The Wolf-Rayet component exhibits changes 
in line strength, including as the He m (doubly 
ionized helium) line at 4686 A. These appear 
to be periodic (Fig. 3A), with a best-fitting pe- 
riod of 124.8 days (Fig. 3B). Periodic changes 
in the line strengths of hot magnetic stars 
are typically interpreted as their rotational 
periods (30, 31), implying a rotational period 
Prot,qwR = 124.8+0.2 days for the Wolf-Rayet 
component. This period is consistent with the 
narrow O v lines, which have a projected ro- 
tational velocity v sin i< 10 kms"!. 


Binary orbit 


The ESPaDOnS spectra indicate that the B7 V 
component exhibits spectral line profile varia- 
tions caused by nonradial gravity-mode pulsa- 
tions (fig. S9). We performed a frequency 
analysis of an optical light curve obtained 
with the Transiting Exoplanet Survey Satellite 
(TESS), which detects several periods, includ- 
ing 1.6 days (25). We therefore concluded that 
the 1.6-day period previously attributed to 
orbital motion (/6) is instead caused by pul- 
sations in the B7 V component. We reassessed 
the binary orbit and the mass of the Wolf-Rayet 
component Mawr under this interpretation. 
To determine the orbit, we analyzed all the 
RV data, which span 24 years (25). We found 
a long-term antiphase motion of the B7 V and 
Wolf-Rayet components (Fig. 4). There are 
multiple RV periodicities associated with 
both the Wolf-Rayet and B7 V components 
(25). We adopted the best-fitting orbit (table 
Sl), which has an orbital period P = 8200 + 
190 days and a semimajor axis a = 10.5 + 1.8 
astronomical units (au). This indicates that 
the components are much more widely sepa- 
rated than had previously been determined. 
The orbital solution indicates a mass ratio 
ofg = Mawr /Mz = 0.60 + 0.13. Combining this 
with the mass of the B7 V component (Mz = 
3.38 +0.10 Me) derived from the Bonnsal 
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analysis implies Mawr = 2.03 + 0.44 Mo (uncer- 
tainties are 68% confidence intervals). This is 
less than the 4.2 Mo previously reported, but 
still above the Chandrasekhar limit. 

The derived mass and luminosity of the 
qwWR component are consistent with mass- 
luminosity relations for helium stars (32). The 
large separation and implied orbital inclina- 
tion of i = 49 + 11° [derived from Mysin*i and 
Mg (table S1)] explain the low-amplitude RV 
motion of both binary components, without 
invoking a pole-on configuration. 


Implications for magnetar formation 


With a mass of 2.03 + 0.44 Mo, we expect the 
Wolf-Rayet component to evolve until it col- 
lapses into a neutron star. We calculated evo- 
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lutionary models of the system, which are 
consistent with this interpretation, although 
the final fate of the Wolf-Rayet component is 
sensitive to uncertainties in the model (25). 
During core collapse, magnetic flux conser- 
vation causes an increase in the magnetic 
field at the surface. With a stellar radius of 
R. qwr = 0.88 + 0.16Re (where Ro is the ra- 
dius of the Sun), calculated with the Stefan- 
Boltzmann relation, our measured (B) wR = 
43.0 + 2.5 kG, and assuming a final neutron- 
star radius of 12 km (33), we calculate the 
expected magnetic field of the neutron star to 
be (B) ys = (1.11 + 0.42) x 10" G. This is with- 
in the range observed for magnetars [(B) = 
10 G (84)]. Our observations and stellar- 
evolution models therefore indicate that the 
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Fig. 4. Two-component orbital solution for HD 45166. RVs of the Wolf-Rayet and B7 V components are 
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start from the zero-age main sequence (ZAMS). The outer tertiary star (gray line) has an initial mass of 
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yellow) and after the main sequence (case AB, green). The upper tracks indicate the evolution of the merger 
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Wolf-Rayet component could be an imme- 
diate progenitor of a magnetar. 

All magnetars in the Milky Way are isolated; 
they do not have a binary companion (6). For 
the Wolf-Rayet component in HD 45166, we 
expect the mass loss and velocity kick im- 
parted on the magnetar by the supernova ex- 
plosion to disrupt the system, given the large 
orbital separation. With an estimated rotation 
period of 125 d and an estimated radius of 
0.3 Ro for the helium core of the Wolf-Rayet 
component (35), angular-momentum conser- 
vation implies that the magnetar immediately 
after collapse would have a spin period <40 ms. 
This is similar to the spin period of the Crab 
Pulsar (33 ms), a neutron star that formed 
about 1000 years ago. Such a spin rate would 
not provide sufficient energy to power a super- 
luminous supernova or a long-duration gamma- 
ray burst (36, 37). 


Evolutionary model of the system 


We next considered how the Wolf-Rayet com- 
ponent itself formed. We excluded the possibly 
that it is the stripped core of a massive star, 
because to produce a 2 Mo stripped core, the 
progenitor star would need to have had an 
initial mass of ~10 Mo. Single-star evolution 
models do not predict that stars of that mass 
become stripped by mass loss (22, 38), and the 
B7 V companion is too far away for binary 
interactions to have done so. In addition, the 
total lifetime (including post main-sequence 
evolution) of a 10 Mo star would be ~30 Myr 
(39), which is well below our derived age of 
the B7 V component (105 + 35 Myr), so we 
reject the possibility that they could both be 
present in the same binary system. 

Stellar mergers have been proposed as a po- 
tential origin of magnetic stars (40-43). Strong 
magnetic fields have been identified in low- 
mass helium stars (classified as OB-type sub- 
dwarfs) that have been suggested to originate 
from merger events between two white dwarfs 
(13, 14). However, to produce a =2 Mo helium 
star through a merger of white dwarfs, the 
merger would need to have involved massive 
CO or ONe white dwarfs, which are rare. Mod- 
els predict that such a merger product would 
either immediately explode as a supernova 
(44) or do so after a short life span of approx- 
imately 10,000 years (45). We argue that ob- 
serving the stellar product of a white-dwarf 
merger in the brief time available is unlikely. 

We therefore propose that the Wolf-Rayet 
component in HD 45166 formed from the 
coalescence of the helium cores of two 
intermediate-mass stars that were bound in a 
close binary. We constructed an evolutionary 
model of this scenario (25) (Fig. 5) using the 
Mesa stellar evolution code (46). We started 
the system in a triple configuration, with a 
tight inner binary and a distant tertiary star 
(the B7 V component). The model indicates 
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that the primary (more massive) star of the 
inner binary expanded and interacted with its 
companion, losing its outer layers and becom- 
ing a stripped star. Meanwhile, the companion 
(secondary star) accreted material and became 
rejuvenated with hydrogen. The secondary star 
later expanded itself, leading to an unstable 
mass-transfer back to the primary. In the model, 
this process leads to the formation of a gaseous 
envelope around the two stars, during which 
they both lose orbital energy owing to friction 
with the envelope, causing them to spiral in- 
ward (known as common-envelope evolution). 
Because of the high binding energy of the 
hydrogen-rich layers, this phase ends with 
both helium cores merging into a +2 Mo mag- 
netized helium star, while most (but not all) 
of the hydrogen envelope is ejected. 

With a luminosity predicted to be 200 times 
that of a 2 Me main-sequence star (47), the 
merger product has a high luminosity-to-mass 
ratio. This, combined with its high effective 
temperature, launches a radiatively driven out- 
flow from its surface. In the absence of a mag- 
netic field, such an outflow would not be easily 
detectable in a spectrum. However, the trap- 
ping of an outflow by a magnetic field is known 
to increase the density of the circumstellar 
material and produce spectral emission lines, 
similar to those we observed in the Wolf-Rayet 
component (26, 48). The distant tertiary star 
in our model does not affect the final outcome 
of the inner binary evolution, except perhaps 
by catalyzing the merging of its stellar com- 
ponents (49). Given its large orbital separa- 
tion, we do not expect the tertiary star to show 
any evidence for accreted material from the 
merger ejecta. 

Our proposed evolutionary scenario is quan- 
titatively and qualitatively consistent with the 
observed properties of the system. The MEsa 
model reproduces the masses of the two com- 
ponents and the estimated system age. The 
proposed merger provides an explanation for 
the emergence of a magnetic field in the Wolf- 
Rayet component, and the magnetic field 
provides an explanation for the presence of 
emission lines in the spectrum of a 2-Mo he- 
lium star. The binary nature of HD 45166 en- 
abled us to use the companion as a clock to 
constrain the evolutionary path of the system, 
and as a scale to determine the mass of the 
Wolf-Rayet component, which are conditions 
that are rarely present in other proposed 
merger products (42, 50). Given the proximity 
of HD 45166 to Earth (=1 kpc), other massive 
magnetic helium stars have likely already been 
spectroscopically identified as Wolf-Rayet stars 
but not recognized as magnetic (supplemen- 
tary text). 
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Qing Dai?*, John Pendry**, Xiang Zhang">**, Shuang Zhang"”* 


Superlenses made of plasmonic materials and metamaterials can image features at the subdiffraction 
scale. However, intrinsic losses impose a serious restriction on imaging resolution, a problem that has 
hindered widespread applications of superlenses. Optical waves of complex frequency that exhibit a 
temporally attenuating behavior have been proposed to offset the intrinsic losses in superlenses through 
the introduction of virtual gain, but experimental realization has been lacking because of the difficulty of 
imaging measurements with temporal decay. In this work, we present a multifrequency approach to 
constructing synthetic excitation waves of complex frequency based on measurements at real 
frequencies. This approach allows us to implement virtual gain experimentally and observe 
deep-subwavelength images. Our work offers a practical solution to overcome the intrinsic losses 

of plasmonic systems for imaging and sensing applications. 


n conventional optical imaging, Abbe dif- 

fraction limits the resolution of feature sizes 

to larger than half the wavelength. This limit 

is due to the loss of the subwavelength infor- 

mation carried by evanescent waves. To over- 
come this limitation, a negative refractive index 
lens has been proposed to enhance the evanes- 
cent waves to recover the deep-subwavelength 
resolution of imaging (/, 2). Subsequently, super- 
lenses, made of either natural materials with 
negative permittivities (3-7) or hyperbolic mate- 
rials with mixed signs of dielectric constants 
along different directions (8-13), have been 
proposed to attain subdiffractional limited 
imaging. Nevertheless, losses are non-negligible 
in materials with negative parameters (J4-16), 
which reduces the deep-subwavelength infor- 
mation of the superlenses and seriously affects 
the resolution of imaging (77-19). To compen- 
sate for these losses, it has been proposed that 
gain materials could be incorporated into meta- 
material designs or plasmonics (20-27), but the 
setup is extremely complicated and the gain 
will inevitably introduce instability and noise 
into the system (28-30). It has been proposed 
that complex-frequency waves (CFWs) with 
temporally growing or attenuating behaviors 
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could provide virtual absorption or virtual gain 
(31-35). Some theoretical proposals have been 
put forward to recover the deep-subwavelength 
information carried by surface plasmons through 
the excitation of CFWs with temporal attenuation 
(32-35). However, synthesizing CFWs is chal- 
lenging in optical systems from a practical 
perspective. 

To address this challenge, we synthesized 
CFW signals using a multifrequency approach. 
We exploited the fact that a truncated CFW 
can be expressed as a combination of multiple 
frequency components with coefficients that 
follow a Lorentzian spectral lineshape through 
the Fourier transformation. We performed mea- 
surements at multiple real frequencies and 
numerically synthesized the field distribution 
under CFW illumination by combining the mea- 
sured field plots at different frequencies accord- 
ing to the Lorentzian lineshape. As a proof of 
concept, both a SiC slab operating at optical 
frequencies and a bulk hyperbolic metamate- 
rial operating at microwave frequencies were 
used as superlenses. We show that, although 
the spatial resolution of imaging at real fre- 
quencies is poor, caused by the inevitable mate- 
rial loss in these systems, ultrahigh resolution 
imaging can be obtained with synthesized CFWs 
that consist of multiple frequency components. 


Loss compensation with CFWs 


We start with an example of loss compensa- 
tion for a metallic material described by the 
Drude model, e(@) = 1 - @,/@ + iwy, where y is 
the nonzero ohmic loss term. Below the plasma 
frequency ,, the permittivity becomes nega- 
tive, making it suitable as a plasmonic mate- 
rial, or for constructing hyperbolic media, to 
support surface or bulk waves with a very large 
wave vector for subdiffraction imaging. Owing 
to the existence of a loss term, the negative 
permittivity is typically accompanied by an 
appreciable imaginary part (left panel of Fig. 1A), 
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which seriously limits the imaging performed Check for 
: , updates 
Interestingly, from a mathematical perspec. 


by transforming the frequency into the suitable 
complex value  — w - iy/2, the permittivity 
is turned into a purely real value e(w) = 1 - 
Op /(@ + 7/4). 

A CFW with a negative imaginary part corre- 
sponds to a wave with temporal attenuation. 
The mathematical expression of a CFW is ex- 
pressed as E(t)~e~*', where ¢ denotes time, 
@o = Mo — it/2, and t > 0 is the temporal at- 
tenuation factor. Although an ideal CFW exists 
mathematically, it is unphysical because it 
implies that the energy would diverge when 
t approaches negative infinity. Hence, a trun- 
cation at the start of time needs to be im- 
plemented to rationalize the CFW, that is, 
Er(t) = Eoe~°'0(t), where 0(t) = 0 for t < 0 
and 0(f) = 1 for ¢ = 0. Using Fourier trans- 
formation, the truncated CFW can be expanded 


into the integration of the spectral components 


E(t) = 2) 


=2 e~ ‘td, where the inte- 


t Mo = @) 
gration is from -~ to © and the Fourier coef- 
ficient 1/2(@o — @) has a Lorentzian lineshape 
[for details, see supplementary materials (SM) 
section 6]. Hence, any response of the sys- 
tem at a complex frequency Wo, including the 
dielectric constant ¢ and the transfer func- 
tion T, can be obtained via the integral of the 
corresponding real-frequency response, as 
e110! aqy/on for suffi- 
1(@o — w) 

ciently long duration ¢. In practice, it is suffi- 


F(@) * Jr(a) 


cient to choose discrete frequency points at a 
certain frequency interval Aw to synthesize 
the signal. Further, based on the compressed 
sensing theory (36-38), the CFW can be syn- 
thesized based on the information taken from 
a finite spectrum range. If the spectrum range 
is broad enough, the noise of the interference 
between different harmonics is suppressed. 
The system response under synthesized CFW 
excitation can be discretized as 


e@ ft +10f Ag / In 


(1) 


Owing to the frequency being discretized, the 
signal has an overall 2x/Aw temporal periodi- 
city. Feeding the permittivity at a number of 
frequencies (the black and gray circles in the 
left panel of Fig. 1A) into Eq. 1, where F rep- 
resents the permittivity, we can obtain the 
synthesized permittivity of complex frequency 
with t = y, as shown in the right panel of Fig. 1A, 
which clearly shows that the loss of the Drude 
model can be largely compensated by virtual 
gain (for details, see SM section 7). 

We use the synthetic CFW to study the imag- 
ing performance in a metal-dielectric multi- 
layer lens with a total of 15 layers (Fig. 1B), 


F(@o)=)>F (wi) — 


1(@p — @;) 
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Fig. 1. Illustration of loss compensation of the superimaging lens using synthetic CFWs. (A) The 


permittivities of the Drude model with a finite damping term before (left) and after (right) loss compensation 
using the CFWs synthesized with permittivities at multiple frequencies. The solid circles indicate that 
permittivities at discretized frequencies can be used to synthesize the complex-frequency permittivity. Red, 
green, and blue waves and dashed lines indicate wave components at the three different frequencies and the 
corresponding real and imaginary parts of the permittivity. (B) The left panel shows a schematic of light 
passing through a metal-dielectric multilayer hyperbolic lens from two closely placed slits at the bottom. 
The thicknesses for the metal layer and the dielectric layer are dm = 0.7 mm and dg = 1.76 mm, respectively. 
The permittivity of the metal is described by the Drude model, with w, = 34x GHz and y = 0.854 GHz. The 
permittivity of the dielectric medium is eg = 2.2. Eopj represents the field distribution at the top surface, 


whereas the right panel shows schematically the field pattern Eps generated from a point source. 

(C) The dispersion plots formed by Fourier transformation of Eopj and Epsr at different frequencies. (D) The 
complex-frequency dispersion plots with t = 0.854 GHz derived by synthesizing the dispersions in (C) 
following Eq. 1. A frequency range from 4 to 8 GHz is used to synthesize the complex-frequency results. 
(E) The corresponding imaging results at a complex frequency of (6.68 — 0.068/) GHz and a real frequency of 
6.68 GHz, as indicated by the red and blue dashed lines in (C) and (D), respectively. /, intensity. 


which functions as a type II hyperbolic media 
with mixed signs of permittivity tensor ele- 
ments along different directions (39). The left 
panel of Fig. 1B schematically shows the field 
emitted from two closely spaced slits of dif- 
ferent widths from the bottom of a flat hyper- 
bolic material. The waves transmitted to the 
upper interface form an electric field pattern 
Eopj(@, 7), where 7 is the real-space coordinate. 
The corresponding distribution in the momen- 
tum space Eopj(@, ”) can be derived by Fourier 
transformation, where & is the in-plane wave 
vector. Inspired by the time-reversal imaging 
technique that has been demonstrated in acous- 
tics and other wave systems (40-46), we used 
a postprocessing procedure to mimick a phase 
conjugation action to restore the image of the 
object. We first performed the complex conju 
gate operation of the momentum space dis- 
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tribution Eopj(o, #)>£%,(@, &) and then mul- 
tiplied it by the transfer function to obtain the 


image in the Fourier space, that is 


Eimag(®, #) = E55 (@, &)T(o, #) (2) 


In this equation, the transfer function is ob- 
tained by Fourier transforming the point spread 
function T(,k) = F|Epsr(,7)|, where Epgr 
(@, 7) is the field emitted from a point source 
located on one side of the hyperbolic slab to 
the opposite surface, as shown in the right panel 
of Fig. 1B. The purpose of multiplying the 
transfer function is to restore the original image 
of the object. Finally, the real-space image pat- 
tern Ejmag(@, 7) can be obtained by the inverse 
Fourier transformation of Eq. 2. 

We used finite element method (FEM) simu- 
lation to numerically calculate E>), (@, &) and 
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T(@, k), which are displayed in the left and right 
panels of Fig. 1C, respectively. The calculated 
image pattern based on Eq. 2 at frequency f = 
6.68 GHz is shown by the red line in Fig. 1E, 
which deviates from the object (shown as the 
dashed line). By contrast, by performing phase 
conjugation in the complex-frequency domain 
(47, 48), that is, by constructing the E>,;(@, *) 
and T(@, x), respectively, using Eq. 1 with t = y, 
the large wave vector components are recovered 
(Fig. 1D). The complex-frequency image faith- 
fully follows the original pattern, as shown by 
the blue line in Fig. 1E, verifying the capability 
of the multifrequency approach to synthesize 
the CFW for substantially enhancing the imag- 
ing resolution compared with that of the real 
frequency (red line in Fig. 1E). 


Experimental results 
Microwaves 


Equations 1 and 2 require the distribution of 
both amplitude and phase of field, which can 
be readily obtained using a microwave charac- 
terization setup for a flat hyperbolic metama- 
terial lens designed to operate at microwave 
frequencies. The unit cell of the metamaterial 
consists of a spiral metallic wire-dielectric layer 
with the dimensions 4 mm by 4 mm by 1.5 mm 
to form a type II hyperbolic metamaterial with 
two identical in-plane negative permittivities 
and one out-plane positive permittivity, as 
shown in Fig. 2A. The spiral structure can reduce 
the plasma frequency such that the accessible 
wave vectors in the Brillouin zone can be much 
larger than those in the free space. The corre- 
sponding equifrequency contours (EFCs) at dif- 
ferent frequencies are retrieved using full-wave 
simulations, as shown in Fig. 2B, where the 
frequencies of the EFCs increase along the 
direction of the arrow. At higher frequencies 
and in the absence of loss, the EFC can reach 
the horizontal Brillouin edge, providing large 
in-plane wave vectors for achieving subdiffrac- 
tional imaging resolution. The corresponding 
in-plane EFCs are depicted in fig. S3. Figure 2C 
displays the experimental setup, wherein the 
bulk metamaterial lens is composed of 80-by- 
80 in-plane unit cells and 25 layers vertically. 
A dipole source is placed at the bottom of the 
sample, and a probe antenna is raster scanned 
at the top surface to measure the near-field 
distribution. The details of the experimental 
setup and samples are depicted in SM sections 
1 to 3. 

To begin, we scanned a one-dimensional (1D) 
line above the sample to measure the field 
distribution, as depicted in Fig. 2D, emitted 
from a single dipole source across 251 discrete 
frequency points within the range of 5 to 7.5 GHz. 
The dispersion is subsequently obtained by 
Fourier transformation (Fig. 2E). The disper- 
sion plot exhibits two bright lines in the middle, 
which locate near the light cone in the di- 
electric material, whereas the other bright lines 
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Fig. 2. Experimental demonstration of loss compensation in superimaging 
using a hyperbolic lens at microwave frequency. (A and B) Photograph of 
hyperbolic metamaterial with metallic spiral wires of 0.2-mm width printed on 
a 1.5-mm-thick Teflon slab (A). The EFCs in the Brillouin zone at different 
frequencies are depicted in (B), with the frequencies distributed evenly between 
0.9 and 6.5 GHz. The Brillouin zone boundaries along the vertical and horizontal 
directions are x/(4 mm) and x/(1.5 mm), respectively. The arrow implies the 
direction of increased frequency of the EFCs. (C) Schematic of the experimental 
setup with the two dipole antennas placed near the top and bottom interfaces 
of the sample, with the bottom and top antennas serving as source and probe, 


respectively. (D) The measured electric field distribution 


with larger wave vectors correspond to the 
hyperbolic modes. Limited by the damping 
of the system, the measured Fourier compo- 
nents in the momentum space are far from 
reaching the Brillouin zone edge. Because the 
frequency marked by the white dashed line 
(f = 6.5 GHz) exhibits relatively large wave vec- 
tors, it is selected as the central frequency for 
the signal synthesis. The 2D field distribution 
in the momentum space at the central frequency 
is shown in Fig. 2F, and the distributions at a 
number of other frequencies are provided in 
SM section 8. Using Eq. 1, we synthesized the 
dispersion of the complex frequency using 251 
frequency points, with the temporal snapshot 
captured at the end of the temporal periodicity 
(Fig. 2G). Notably, the synthesized results with 
complex-frequency excitation recover the 
field components across a large portion of 
the Brillouin zone, indicating a much better 
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along a line from -150 to 


spatial resolution than that of the real fre- 
quency. Note that the Fourier space distri- 
butions shown in Fig. 2, F and G, correspond 
to the 7a, %) and T(@, #) of the real-frequency 
and complex-frequency cases in Eq. 2, 
respectively. 

We next showcase super-resolution imaging 
of subwavelength patterns with the synthesized 
complex frequency. The desired pattern is 
formed by sequentially placing a single emitt- 
ing dipole antenna at different locations, and 
the field distribution at the top surface is mea- 
sured at each location of the dipole source. The 
measured field distributions are then linearly 
superimposed to form the image. Subsequently, 
Fourier transformation is performed to obtain 
the Fourier pattern at the real and complex 
frequencies, E>,;(@,%) and £5,;(@,%). The 
ground truth of three different patterns, letters 
H, K, and U, are shown in Fig. 2H, and the 
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+150 mm at the top interface with a frequency step of 0.01 GHz, which locates in 
the same y-z plane as the source antenna. (E) The dispersion plot obtained by 
spatial Fourier transformation of the field distribution shown in (D). (F) The 2D 
electric field distribution in the momentum space at a frequency of 6.5 GHz, 
which serves as the transfer function. (G) The 2D electric field distribution in the 
momentum space for a CFW with a frequency of (6.5 — 0.0065/) GHz synthesized by 
the measurements at multiple frequencies. (H) The ground truth of the letters 

H, K, and U. (I to K) The imaging results with the dipole sources arranged in the 
shape of the three letters. The top and bottom subpanels correspond to the Fourier 
space field distributions and real-space intensity distributions, and the left and right 
subpanels show the complex-frequency and real-frequency results, respectively. 


corresponding imaging results obtained by 
using Eq. 2 are displayed in Fig. 2, I to K, 
respectively. The top-left image in each sub- 
panel shows the Fourier pattern with synthe- 
sized complex-frequency excitation, which 
occupies a substantial portion of the Brillouin 
zone, whereas the corresponding bottom- 
left image of each subpanel displays the image 
of the subwavelength letters. By contrast, the 
imaging results at the real frequency are shown 
in the right subpanels for all three cases, and, 
in each case, the original pattern cannot be 
identified, thereby confirming a substantial 
improvement in superimaging with the complex- 
frequency approach, from A/2 to 4/6, where 
iis the free-space wavelength at the central 
frequency (6.5 GHz) of the synthesized CFW. 
We next investigated the effect of the number 
of frequency points used to synthesize the complex- 
frequency response on the superimaging 
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Fig. 3. Investigation of the dependence of the imaging quality over the number of frequency points. 
The ground truth is shown in the top-left panel, which consists of a five-by-five antenna array, where the 
lattice constants along the x and y directions are 1/4 and 2/6, respectively. The other panels show the image 
intensity distributions that were obtained using different numbers of frequency points, with the central 
frequency fixed at 6.5 GHz and a fixed frequency step of 0.01 GHz. The corresponding synthetic temporal 


signal is depicted in the inset of each panel. 


performance, with the results shown in Fig. 3. 
The top-left subpanel of the figure displays the 
object, which is composed of a five-by-five array 
of dipole antennas. The horizontal and vertical 
lattice constants are one-fourth and one-sixth 
of the central wavelength, respectively. We grad- 
ually reduced the number of frequency points 
from 251 to 1 to construct the synthetic imag- 
ing patterns, with a fixed frequency step of 
0.01 GHz, and the corresponding synthesized 
temporal signals are depicted in the insets. 
Our study shows that reducing the number of 
frequency points to 51 has a negligible impact 
on image quality. However, as we continued to 
reduce the number of frequency points, there 
was a substantial decrease in imaging reso- 
lution, causing the images of the dipoles to 
merge into vertical lines. When the number 
of frequency points drops below 17, these lines 
become wider in the horizontal direction. The 
corresponding Fourier distributions are shown 
in fig. S6. This highlights the importance of 
having a sufficient number of frequency points 
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to maintain good spatial resolution and avoid 
image degradation. 


Infrared 


To showcase the versatility of the complex- 
frequency method, we used a silicon carbide 
(SiC) superlens operating at mid-infrared 
(mid-IR) frequency to investigate the loss 
compensation for superimaging. The design of 
the superlens is based on a SiC sandwich struc- 
ture (Fig. 4A), in which the upper and lower 
surfaces correspond to the image and object 
planes of the lens, respectively (6). The fabrica- 
tion technique can be found in SM sections 4 
and 5. The object is composed of a 1D grating of 
varying spacings or circular apertures of vary- 
ing diameters patterned on a metal film [scan- 
ning electron microscopy (SEM) image shown 
in the inset of Fig. 4B]. The optical image is 
captured by the technique of scattering-type 
scanning near-field optical microscopy (s-SNOM). 
With this technique, both the amplitude and 
phase of the image field can be collected by the 
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probe. The measured field pattern of the 1D 
grating at the image plane is Fourier analyzed 
at different frequencies and presented in Fig. 
4B. Here, the frequency interval is 2 cm™', 
from 900 to 980 cm“, with a total of 41 points. 
Figure 4C displays the profiles of the grating 
captured by the SiC superlens at both the com- 
plex and real frequencies, in both real space 
(eft panel) and momentum space (right panel). 
These profiles demonstrate superior imaging 
quality at the complex frequency compared 
with the real frequency. 

The SEM image of the array of 2D circular 
apertures is shown in Fig. 4D. The field dis- 
tributions in Fig. 4, E to G, correspond to the 
images captured at the complex frequency and 
two distinct real frequencies, respectively. The 
complex-frequency image closely resembles 
the SEM image, whereas only heavily blurred 
field patterns are observed at the real frequen- 
cies. To investigate the resolution limitations 
of the SiC superlens with loss compensation, 
two pairs of holes placed closely together, but 
with varying displacements, were fabricated; 
Helium ion microscopy (HIM) images are shown 
in Fig. 4H. The corresponding s-SNOM images 
at the complex and real frequencies are shown 
in Fig. 4, I and J, respectively. Using the complex- 
frequency method, two pairs of circles with 
edge-to-edge separations of 40 and 100 nm, re- 
spectively, can be clearly resolved, whereas no 
discernible images are formed at the real fre- 
quency. The spatial resolution shown in the 
measurement of the double-aperture structure 
using complex frequency is around 400 nm, as 
shown in Fig. 4I, whereas the distance be- 
tween two neighboring field maxima at the cor- 
responding central frequency is around 1200 nm, 
as shown in Fig. 4G. 


Concluding remarks 


We have presented a method to compensate 
for the intrinsic loss of hyperbolic metamate- 
rial and for SiC superimaging lenses by synthe- 
sizing a complex-frequency excitation through 
a multifrequency approach, which improves 
imaging resolution beyond the limit imposed 
by the damping of the system. Our approach 
successfully overcomes the challenges of experi- 
mentally implementing CFWs in the time do- 
main, which include the need for precise CFW 
synthesis and time-gated measurements after 
reaching the quasi-steady state, and holds great 
potential for high-resolution microscopy. Fur- 
thermore, the synthesized complex-frequency 
approach can be extended to other areas of 
optics, such as plasmonic sensing applications. 
By leveraging the enhanced quality factor of 
plasmonic structures, our approach has the 
potential to substantially improve sensitivity 
in sensing applications. In addition, the ap- 
proach can be tailored to different systems 
and geometries, providing a flexible and ver- 
satile tool for improving optical performance. 
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Fig. 4. Experimental observation of loss compensation in mid-IR superima- 
ging using a SiC superlens. (A) Schematic of the s-SNOM experimental setup. 
A ~440-nm-thick SiC layer is sandwiched between two SiOz layers of equal 
thickness (~220 nm). A 60-nm-thick gold grating is placed at the bottom. 

(B) Fourier distribution of a line scan at the imaging plane in the 1D grating 
structure with varying spacings (SEM image is shown in the inset; scale bar, 

2 um). The dashed line represents the central frequency of 930 em. 

(C) Imaging patterns at both the complex frequency f = (930 — 12i) cm~? 
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Human POT] protects the telomeric ds-ss DNA 
junction by capping the 5‘ end of the chromosome 


Valerie M. Tesmer, Kirsten A. Brenner, Jayakrishnan Nandakumar* 


Protection of telomeres 1 (POT]) is the 3' single-stranded overhang-binding telomeric protein that 
prevents an ataxia telangiectasia and Rad3-related (ATR) DNA damage response (DDR) at chromosome 
ends. What precludes the DDR machinery from accessing the telomeric double-stranded-single-stranded 
junction is unknown. We demonstrate that human POT1 binds this junction by recognizing the 
phosphorylated 5' end of the chromosome. High-resolution crystallographic structures reveal that the 
junction is capped by POT1 through a “POT-hole” surface, the mutation of which compromises junction 
protection in vitro and telomeric 5'-end definition and DDR suppression in human cells. Whereas 

both mouse POT] paralogs bind the single-stranded overhang, POTla, not POT1b, contains a POT-hole 
and binds the junction, which explains POT1a’s sufficiency for end protection. Our study shifts the 
paradigm for DDR suppression at telomeres by highlighting the importance of protecting the 


double-stranded-single-stranded junction. 


ucleoprotein complexes called telomeres 
cap chromosome ends to ensure genome 
integrity. Human telomeric DNA contains 
~10 to 15 kb of tandem 5’-GGTTAG-3'/3'- 
CCAATC-5’ repeats. Although telomeric 
DNA is primarily double-stranded (ds), all 
chromosomes terminate in a 50- to 500- 
nucleotide (nt) single-stranded (ss) G-rich 
telomeric overhang (Fig. 1A, bottom) (7). The 
six-protein shelterin complex coats telomeric 
DNA to protect chromosome ends from being 
recognized as dsDNA breaks by the ataxia 
telangiectasia and Rad3-related (ATR) kinase- 
and ataxia-telangiectasia mutated (ATM) 
kinase-mediated DNA damage response (DDR) 
machineries (2, 3). ATR signaling involves 
multiple protein factors and coordinated recog- 
nition of both the ss and the adjacent ds-ss 
junction of its DNA substrates (4). Protection 
of telomeres 1 (POT]1) is a shelterin component 
that binds the ss G-rich overhang with high 
affinity and sequence specificity and prevents 
ATR signaling at telomeres (2, 5, 6). POT1 
recognizes ssDNA through its DNA bind- 
ing domain (DBD), which consists of two 
oligonucleotide/oligosaccharide-binding (OB) 
domains (Fig. 1A). Previous studies have re- 
ported a decanucleotide TTAGGGTTAG within 
two telomeric ss repeats, 'GGTTAGGGTTAG”, 
to be sufficient for high-affinity binding to hu- 
man POT (hPOTI) (7). The first OB domain 
(OB1) of hPOT1 binds >TTAGGG® (OB1?™), 
whereas its second OB domain (OB2) binds 
°TTAG? (OB2?™) (Fig. 1, A and B) (7). Homologs 
of POTI are identifiable across eukaryotes 
(5, 8-15), and deleting the POT paralog in 
mice that is involved in chromosome-end pro- 
tection (POT1a) is embryonic lethal (74, 15). 
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The current model for ATR suppression at 
telomeres invokes the prevention by POT1 of 
replication protein A (RPA) loading onto the ss 
overhang, through POTI’s high affinity for 
telomeric ssDNA and its tethering to the rest 
of shelterin at telomeric dsDNA (2, 6, 16). Yet, 
multiple observations suggest that addition- 
al features of POT1 are involved in ATR rep- 
ression. First, mouse POT1 paralogs POTla 
and POTIb display indistinguishable ssDNA- 
binding activity, but only POTIa is sufficient for 
chromosome-end protection (6, 5, 17), whereas 
POTIb regulates chromosome-end processing 
and replication activities (16, 18-21). Replac- 
ing the DBD of POTIb with that of POT1a or 
hPOTI1 enables ATR repression at telomeres 
(77). Second, replacing the DBD of POT1a with 
that of ssDNA-binding protein RPA70 is not 
sufficient to fully repress ATR signaling at 
telomeres in mouse cells that lack POTIa 
(16). Moreover, POT1’s binding to the G-rich 
ss overhang does not explain how it dictates 
the 5’ end of the C-rich strand, which terminates 
predominantly in ATC-5' in mammals (22, 23) 
(Fig. 1A, bottom). These observations are con- 
sistent with the DBD of hPOT1 and mouse POTIa 
carrying out an additional function relevant to 
ATR suppression. 


Results 
Human POT] binds a 5’-phosphorylated 
telomeric ds-ss DNA junction 


We hypothesized that hPOT]1 binds to the telo- 
meric ds-ss junction after we reanalyzed its pub- 
lished DNA binding-site preferences. Two classes 
of POT1 binding sites emerged from previous 
SELEX (systematic evolution of ligands by ex- 
ponential enrichment) analysis, one of which 
was the expected °TTAGGGTTAG” (OB1ID™ and 
OB2™“) site (Fig. 1B, Class I) (24). A second class 
contained OBI, an upstream tri-K (“K” indi- 
cates a G or T nucleotide), and a seemingly non- 
telomeric (NT) sequence implicated in binding 


18 August 2023 


suspected on the basis of the observation that 
the tri-K GGG motif corresponds to the telo- 
meric repeat sequence upstream of OBI“, 
and NT sequences in the Class IT hits could fold 
into a hairpin (hp) containing a 2-base-pair (bp) 
stem ~'GG°/-°CC~ and a variable tetraloop 
(positions —5 to —2) (Fig. 1B and fig. S1, A and B). 
In this interpretation, G° and C~’ represent the 
first base pair at the ds-ss junction (with C~’ 
corresponding to the 5’ end of the mammalian 
chromosome), and the 3’ overhang initiates in 
the GGTTAG register (Fig. 1, A and B, and fig. S1, 
A and B). We conducted a quantitative electro- 
phoretic mobility shift assay (EMSA) with puri- 
fied hPOT1 DBD (hDBD) (fig. S2A) and a5'-”°P- 
labeled hp oligonucleotide derived from the 
Class II consensus terminating in a C at the 
5’ end and containing a 3’ overhang of sequence 
1GGTTAGGG® (hp-ss"*) (Fig. 1C). The absence 
of OB2?™ from the Class II consensus atten- 
uates the affinity of hDBD for ssDNA (7), allow- 
ing us to assess DNA affinity of POT1 for the 
ds-ss junction. hDBD bound strongly to hp-ss'® 
[dissociation constant (K,) = 2.6 + 0.3 nanomolar 
(nM)] but not to a similar target (no_hp-ss’®) 
that lacks the ability to form a hairpin (Fig. 1, C 
and D). The natural telomeric ds-ss junction 
ends in a 5’-phosphate (5’-P), which has been 
previously exploited to determine the 5'-terminal 
nt of chromosomes by using DNA ligase- 
mediated methods (25). To test the importance 
of this phosphate in binding hPOT1, we per- 
formed a competition experiment mixing 
5'-°?P-hp-ss!® with either nonradiolabeled 
5'-phosphorylated hp-ss'® or 5'-OH-hp-ss'® be- 
fore binding to hDBD. The 5’-P was required to 
effectively outcompete POT1 binding to the 
radiolabeled DNA (Fig. 1E). The absence of a 
5'-P at the DNA junction in past in vitro 
studies may have prevented the detection of 
this previously unappreciated POT1 DNA- 
binding activity (17, 26-29). POT1 bound to 
a telomeric ds-ss junction in vivo is poised to 
engage both OBI?™“ and OB2?™. Extension of 
the overhang of the hp to include OB2?™“ (hp- 
ss‘"”) resulted in a higher affinity for hDBD 
[Ka = 70 picomolar (pM)] (Fig. 1, C and F) 
compared with either ss’? (Ka = 190 pM) 
(Fig. 1C and fig. S2B) or hp-ss"® (Fig. 1, C and D). 
We confirmed that a heterodimer of full-length 
hPOT1 and shelterin partner TINTI-PTOP-PIP1 
(TPP1) (by using the TPPIN truncation con- 
struct), which approximates the context of 
hPOTI coating the ss overhang in vivo (30, 37), 
exhibited robust binding to 5’-P-hp-ss’” 
(Fig. 1G). DBD bound a two-stranded DNA 
(duplex reinforced with 30 bp of arbitrary, 
nontelomeric sequence) terminating in 5 bp of 
native ds telomeric junction sequence and an 
8-nt overhang (long_ds-ss'®) with an affinity 
that was approximately one order of magni- 
tude greater than observed with hp-ss"*, likely 
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Fig. 1. Human POTI recognizes the 5’-phosphorylated 
ds-ss junction of telomeres. (A) (Top) Schematic 

of hPOT1 includes binding domains for ssDNA 

(hDBD) and TPP1 (TPP1-BD). hDBD (PDB: 1XJV) is 
composed of OB1 and OB2. The current model 
suggests that POT1 outcompetes the ssDNA-binding 
RPA complex to prevent ATR signaling at telomeres. 
HJRL, Holliday junction resolvase-like. (Bottom) 
Mammalian chromosomes end in a ds-ss junction 
containing ATC-5' (predominantly) and a ss 
G-rich overhang. Numbering starts with the first 
overhang nucleotide. (B) A previous SELEX study 
evealed two hPOT1-binding DNA classes (24). 

Class | harbors the known sites for OB1 and OB2, 
denoted as OBIPN* (cyan) and OB2>%4 (pink), 
espectively. Class Il revealed a consensus containing 
a seemingly nontelomeric (NT) sequence upstream of 
OBI>’ that can potentially fold into a hp; “K” 
indicates a G or T nucleotide, and the shaded area 
indicates the sequence of the first bp at the telomeric 
ds-ss junction. (©) Annotated name, sequence, 
predicted hp structure [with T,, (where T,, is the 
temperature at which 50% of dsDNA is denatured) 
calculated by the UNAFold web server], and mean Ky 
and SD (of binding to hDBD) of the oligonucleotides 
used in EMSA analysis. NA indicates not applicable. 

(D to H) EMSA of indicated proteins (hDBD or POT1- 
TPPIN heterodimer) and 5'-°*P-labeled DNA oligo- 
nucleotides. (D), (F), (G), and (H) indicate direct 
binding experiments, and (E) indicates a competition 
experiment. In (D), 0.1 nM 5'-°P-hp-ss*® was used; 
the number of experimental replicates n = 5 for 
hp-ss'® (full and partial titrations); n = 3 for no_hp-ss*®. 
In (E), 100 nM hDBD and 0.1 nM 5'-22P-labeled hp-ss*® 
were incubated with indicated amounts of unlabeled 
hp-ss'® (cold DNA) containing either a 5'-OH or a 5'-P; 
n = 3. In (F), 0.001 nM 5'-°P-hp-ss! was used; n = 3. 
In (G), 0.01 nM 5'-°*P-hp-ss!* was used; n = 3. (H) 
0.001 nM 5'-*P-long_ds-ss*® was used: n = 3. Circled 
red “P” indicates radiolabeled; circled black “P” 
indicates nonradiolabeled. Bound (B) indicates DNA 
bound to protein; Free (F) indicates free, unbound DNA. 


reflecting the greater stability of the more phys- 
iologically representative duplex DNA versus 
that of the hp (Fig. 1, C and H). Our data dem- 
onstrate that the telomeric ds-ss junction is a 
previously unappreciated high-affinity binding 
site for hPOTI. 


High-resolution structures reveal how human 
POT1 caps the phosphorylated 5' end of a 
telomeric junction 


To determine the structural basis for hPOT1’s 
telomeric ds-ss junction-binding activity, we 
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formed complexes of hDBD with two sub- 
strates that mimic the telomeric ds-ss junction— 
5'-P-ds-ss"’ (DNA containing a 5-bp arbitrary, 
nontelomeric tether upstream of GITTAG/CAATC- 
5'-P native telomeric ds sequence extending into a 
12-nt 3’ overhang) (fig. S2, C and D) and 5’-P-hp- 
ss’? (Fig. 1C)—and solved their structures 
using x-ray crystallography (Fig. 2, A and B). 
The hDBD-bound 5’-P-ds-ss’”” and 5’-P-hp-ss*”” 
structures were solved to 2.60- and 2.16-A res- 
olution, respectively (table S1). Both struc- 
tures are similar to each other (fig. S3D) and 
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1234567 
2® long_ds-ss'* 
(0.001 nM) 


123456 
“®hp-ss*? 


recapitulate the previously reported hDBD-ss 
DNA-binding interface with minor differences 
(fig. S3, A to C, and E to J) (7). These structures 
reveal how hPOTI binds the phosphorylated 5’ 
end of the telomeric ds-ss junction (Fig. 2). An 
electropositive pocket of four amino acids (Y9, 
R80, H82, and R83) in the hPOT1 OB1 domain 
that we name the “POT-hole” caps the 5'-P- 
cytidine nucleotide by means of a network of 
stacking and electrostatic interactions (Fig. 2, 
D to F, and fig. S4A). R83 acts as the linchpin 
by forming an ionic interaction with the 5’-P, 
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Fig. 2. Structural basis of telomeric junction 

5’- end protection by human POT1. (A and B) 
Cartoon representation of high-resolution crystal 
structures of complexes of (A) hDBD with 5'-P-ds- 
ss? and (B) 5'-P-hp-ss'?, showing OB1 (cyan) 
and OB2 (pink) bound to DNA [gray, with the 
exception of the 5'-P, whose atoms are shown as 
spheres and in Corey-Pauling-Koltun (CPK) coloring]. 
A boxed schematic of the DNA is shown below the 
structure, with the ds sequence found naturally at 
the telomeric ds-ss junction shaded gray, the 5'-P in 
red, and residues in the G-rich 3' overhang colored 
to indicate binding by OB1 and OB2, respectively. 
(C) Cartoon and (E) electrostatic surface (blue 

is electropositive and red is electronegative) repre- 
sentations of the hDBD-5'-P-ds-ss!* structure shown 
in a view orthogonal to that in (A). The 5'-P occupies 
a pocket in POT1 that is complementary in shape 
and charge. Single-letter abbreviations for the amino 
acid residues are as follows: H, His; R, Arg; and 

Y, Tyr. (D) The POT-hole-DNA interface within the 
hDBD-5'-P-hp-ss!" structure is shown with POT-hole 
side chains (carbon in cyan) and the nucleotides 
(carbon in light gray) near the junction shown as 
sticks. A water molecule bridging hPOT1 R80 to 

the 5'-P is shown as an orange sphere. The dashed 
lines indicate H-bonds and ionic interactions, the 
double-headed arrow indicates stacking of the hPOT1 
R83 side chain with the 5'-C at the junction 
(numbered CO), and N indicates the N terminus of 
hDBD resolved in the crystal structure. (F) Interaction 
map of hDBD with the ds-ss junction. 


stacking against the 5’-cytosine base, and form- 
ing hydrogen bonds (H-bonds) with the ribose- 
ring oxygen of the 5’-cytidine nucleoside (5’-C) 
(Fig. 2, D and F, and fig. S4A). R83 also forms 
an H-bond with Y9, which along with H82 
interacts with the 5’-P. R80 forms a water- 
mediated H-bond with the 5’-P. We observed 
that the POT-hole is not optimally sized to 
accommodate a bulkier adenine (purine in- 
stead of a pyrimidine) or thymine (methyl group 
on the base) at the 5’ end because of steric clashes 
(fig. S5, A to C). Furthermore, the fixed distance 
between the POT-hole and the ssDNA-binding 
region of hPOT! dictates the preference for the 
naturally occurring ATC-5’ versus alternative 
5'-C iterations: ATCC-5’ and ATCCC-5' (fig. S5D). 
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A Structure of hDBD with 5’-P-ds-ss"2 


oY, 


In addition to interactions involving the POT- 
hole, junction recognition is fortified by con- 
tacts made by the backbone amides of hPOT1 
amino acids 121 to 124 with the phosphodiester 
group penultimate to the 5’-C (Fig. 2F and fig. 
S4B), as well as S99 with G? (Fig. 2F and fig. 
83, K and L). These data provide the structural 
basis for binding of the telomeric ds-ss junction 
by hPOT1. 


The POT-hole dictates telomeric DNA junction 
binding and inhibits DDR at telomeres 


We evaluated the importance of the POT-hole 
in binding the telomeric ds-ss junction in vitro 
using purified hDBD variants with alanine 
mutations at Y9, R80, H82, and R83 (fig. S6A). 
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B Structure of hDBD with 5’-P-hp-ss*“? 


IG ee 
C ,GEGGTTAGGGTTAG 3} 
“101 3 12 


_ds-ss 
junction 


ss G-rich overhang —> 


We also engineered an R83E charge-reversal 
mutant to test the importance of the ionic R83- 
5’-P interaction. Alanine substitution of F62, a 
residue in hPOT1 OBI that is indispensable 
for binding telomeric ssDNA (32), was included 
as a control to disrupt binding to both ssDNA 
and the ds-ss junction. In agreement with the 
structural data, little to no DNA binding was 
observed for any POT-hole mutant with the 
5'-P-hp-ss'®, even at concentrations 100-fold 
greater than the Kg with wild-type (WT) hDBD 
(Fig. 3A, left). By contrast, POT-hole mutants 
bound 5'-P-ss’”” with an affinity similar to that 
of wild type (Fig. 3A, right, and fig S6, B and C). 
F62A failed to bind either oligonucleotide, 
which is consistent with binding to OB1D““ 
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Fig. 3. Separation-of-function POT-hole mutations abrogate ds-ss DNA 
junction binding in vitro and result in a DDR at human telomeres. 

(A) EMSA to detect direct binding of WT or indicated mutant hDBD constructs 
with 5'-3*P-hp-ss® (0.1 nM; lanes 1 to 22) and 5'-°2P-ss!! (0.1 nM; lanes 23 to 
30); n = 3. (B) Schematic conveying how POT-hole mutations would disrupt 
binding to the ds-ss junction but not coating of the ss overhang by POT1. 

(C) Scheme for deletion of endogenous POT1 and complementation with lentivirally 
transduced hPOT1-Myc to assess the ability of mutants to suppress TIF formation 

in a HEK 293E-based cell line (34). (D) TIF analysis of cell lines after 4-OHT and dox 
(1000 ng/ml; 25 ng/ml in “low dox” wild type) treatment as described in (C) performed 
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with peptide nucleic acid fluorescen 


t in situ hyb 
F) for Myc (h 


idization (PNA-FISH) for telomeres 
(green) and immunofluorescence (I POT1; cyan) and 53BP1 (red). 4',6- 
diamidino-2-phenylindole (DAPI) was used to stain the nucleus (blue). Overlap of 
the telomeric and 53BP1 foci (and Myc foci, if applicable) in the “Merge” panel 
indicates TIFs. (Inset) Magnified view of the boxed area within the image; arrowheads 
indicate TIFs. (E) Quantitation of TIF data of which (D) is representative. Mean 

and SD (n = 3 for all conditions except WT -dox, for which n = 5; each +dox set 
containing >75 nuclei and each —dox set containing >50 nuclei) for TIFs are plotted 
for the indicated cell lines. P values calculated with a two-tailed Student's t test 
for comparisons against WT +dox data are indicated above the bars. 
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Fig. 4. Presence of the POT-hole dictates POT1 paralog choice for two-stranded DNA; n = 3. (F) (Top left) Names and sequences of the two DNA 


chromosome-end protection in mice. (A) Human POT-hole residues are conserved in 
mouse POT1la but not mouse POT1b. (B) Electrostatic surface comparisons of 
hDBD (from hDBD-5'-P-ds-ss! structure) and POTla and POT1b DBD (Alphafold 
models), with the phosphorylated 5'-C of the hDBD-bound structure shown in 
sticks. (© and D) EMSA analysis of indicated mouse POTla and POT1b DBD 
constructs with the indicated 5'-**P-labeled oligonucleotides [0.1 nM for (C) 

and (D), right; 0.01 nM for (D), left]; n = 3. (E) EMSA analysis of indicated 

human and mouse POT1 DBD constructs with 0.001 nM 5'-°2P-long_ds-ss?® 


being critical for both DNA-binding modes. 
These data highlight the importance of the POT- 
hole in 5’-end binding and provide separation- 
of-function mutants to test the importance 
of hPOT1’s junction-binding activity in cells 
(Fig. 3B). 

Loss of POT1 binding at the 3’ overhang re- 
sults in telomere dysfunction-induced foci (TIF), 
which signify the recognition of telomeres by 
the DDR machinery (33). To determine the bio- 
logical importance of the POT-hole binding to 
the telomeric junction, we used a previously 
described cell line in which POT] can be deleted 


Tesmer et al., Science 381, 771-778 (2023) 


oligonucleotides, hp-ss 


in an inducible fashion (POTI KO) (34) to test 
the ability of transduced WT and mutant hPOT1 
Myc-tagged constructs to compensate for the 
loss of endogenous POT! (materials and meth- 
ods). Transduced cells were treated first with 
4-OHT to delete POT? and then either treated 
with doxycycline (dox) to induce exogenous 
hPOTI1 expression (“+dox”) or left untreated 
(“-dox”) (Fig. 3C). In the absence of dox, 
4-OHT treatment resulted in a robust TIF 
phenotype, characterized by colocalization of 
the DDR factor 53BP1 at telomeres (fig. S6, E 
and F). hPOT1 wild type and “low dox” wild 
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124 and ss 


ence. Both DNAs were labeled at the 5' end with °@P for EMSA analysis. (Bottom 
left) Three possible DNA-binding registers for the first DBD molecule are 
shown with the center-binding register precluding the binding of a second DBD 
molecule. (Right) EMSA analysis of POTla DBD with hp-ss!** (discrete slow-migrating 
band with increasing concentrations of protein; 2xB) and ss'@4 (smeary band: 
mixture of B and 2xB), DNA at 0.1 nM; n = 3. (G) EMSA analysis of indicated POT1a 
DBD constructs with 0.1 nM hp-ss'*4. YHR, triple mutant Y9S-H82Q-R83G; n = 3. 


124 used to evaluate 5'-end-binding prefer- 


type, but not hPOT1 F62A, suppressed TIFs 
(Fig. 3, D and E). POT-hole mutants Y9A, 
R83A, and R83E were defective in TIF sup- 
pression compared with wild type, with R83E 
being the most deleterious (Fig. 3, D and E). 
This trend emphasizes the importance of the 
ionic interaction between R83 and the 5’-P. 
Clones isolated from 6X-Myc-tagged hPOT1 WT, 
F62A, and R83E cell populations also reca- 
pitulated the TIF phenotypes (fig. S7, A to D). 
Furthermore, TIFs were smaller (Fig. 3D, inset) 
and less frequent (Fig. 3, D and E) in POT-hole 
mutant cells compared with those in F62A 
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cells. This finding suggests that both junction- 
and ssDNA-binding activities of hPOT1 must 
be compromised to trigger a full DDR (see 
Discussion). Our results demonstrate that junc- 
tion binding, which should involve a single 
POT! molecule per chromosome end (Fig. 3B), 
is critical for chromosome-end protection. 


The POT-hole differentiates mouse POT1 
paralogs and enables POT1a to protect the 
telomeric junction 


Despite being strictly conserved in other mam- 


malian POT1 homologs, including mouse POTIa, 
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tethered to rest of shelterin at ds DNA 


each of the four POT-hole amino acids is re- 
placed with a structurally disparate residue in 
mouse POTIb (Fig. 4A and fig. S8A). By contrast, 
the residues used in ss DNA binding are con- 
served in all mammalian POT1 homologs, in- 
cluding POTIb (fig. S8A). Aligning Alphafold 
predictions (35) of POTla and POT1b DBDs 
with the junction-bound structure of hDBD 
illustrates that the shape and electropositive 
nature of the POT-hole are predicted to be lost 
in POTIb (Fig. 4B and fig. S8, B and C). We 
hypothesized that POTIa, but not POTIb, pro- 
tects the 5’ end at the junction. Indeed, POTIa- 


18 August 2023 


Fig. 5. Maintenance of the ATC-5’ end of chromo- 
somes by the POT-hole. (A) Schematic of the 
modified STELA technique for determining the 
chromosomal 5'-terminal nucleotide in human cell 
lines. Step 1: DNA ligation of genomic DNA 5'-P ends 
with telorettes ending in each of the six possible 
repeat registers at the 3'-OH ends. Step 2: PCR 
amplification of the ligation products performed with 
a forward primer (PCR-F) targeting the subtelomere 
of chromosome XpYp and a reverse primer (PCR-R) 
targeting a sequence shared by all telorettes. The 
products are visualized with Southern blot analysis 
performed with a 5'-°*P-labeled XpYpB2 reverse 
primer. (B) STELA-based determination of the 
chromosomal 5'-terminal nucleotide in the HEK 
293E-based POT1 KO parental cell line (-4-OHT 
and +4-OHT) and hPOT1-Myc WT- or R83E- 
complemented clonal cell lines treated with both 
4-OHT and dox. (C) Quantitation of ATC-5' preference 
calculated as the ratio of the total band intensity in the 
primer 3 lane over the total intensity over all six 

lanes. Mean and SD for n = 4 replicates of which B is 
representative are plotted. P values were calculated 
with a two-tailed Student's t test for comparisons 
against parental -4-OHT data (for parental +4-OHT) or 
hPOT1-Myc WT clones (for hPOT1-Myc R83E clones). 
(D) (Left) TRF analysis of cell lines used in (B) 
performed first under native conditions with a 
5'-32P_labeled telomeric C-probe (CTAACC), to detect 
the ss G-rich overhang. (Right) TRF analysis after 
denaturing the DNA on the same gel and reprobing it to 
detect the total telomeric DNA signal; n = 1. (E) Model 
for ATR inhibition at telomeres by POT1. The ssDNA- 
binding of hPOT1 prevents the loading of RPA to curb 
ATR recruitment to the 3' overhang. Protection of the 
ds-ss junction by hPOT1 prevents loading of the 
9-1-1/Rad17-RFC clamp and clamp-loader complex and 
ATR activator TOPBPI. In mice, both POT1 paralogs 
coat the ss overhang, but only POT1a protects the ds-ss 
junction. The shelterin proteins protecting the telomeric 
dsDNA are expected to keep POT1-TPP1 tethered to 
the ss overhang, facilitated by protein-protein 
interactions and the conformational flexibility within 
the proteins (29) and the telomeric DNA. 


and POTIb-DBD proteins bound ss'™, but only 
POTiIa DBD engaged a telomeric ds-ss junction 
with high affinity (Fig. 4, C to E, and fig. S8, D 
and E). POTIa replaced with POTIb residues in 
the POT-hole (except R80; fig. S8F legend 
explains rationale) retained affinity toward ss’ 
(Fig. 4C) but failed to bind the junction (Fig. 4D, 
right, and E, and fig. S8D). 

To measure junction-binding in the presence 
of multiple ss DNA-binding sites, we developed 
an EMSA-based “POT packing” assay with two 
DNA targets, each containing four telomeric 
ss repeats (24 nt) spanning three possible 
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POT1-binding registers. The 5’ and 3’ registers 
are compatible with the packing of two POT1 
molecules, whereas binding to a central reg- 
ister precludes the loading of a second POT1 
(Fig. 4F, left). hp-ss'™ includes a ds-ss junction 
upstream of this ss region, whereas ss‘?* does 
not. A fully packed 2:1 DBD-DNA complex 
would produce a sharp, slow-migrating band 
at higher DBD concentrations, whereas a mix 
of 2:1 and 1:1 complexes (of various binding 
registers) would generate a smear. POT1la DBD 
binding resulted in a sharp band for hp-ss'* 
but not ss’, suggesting that the protein packs 
preferentially against a ds-ss junction but that 
there is no end-binding bias to dissuade it from 
binding to the central site of ss’** (Fig. 4F, 
right). POTla POT-hole mutants R83G and 
triple mutant YHR lost the ability to pack at the 
junction (Fig. 4G), which is consistent with R83 
capping the 5’ terminus (Fig. 2, D and F) and 
repressing TIFs (Fig. 3, D and E). hDBD and 
mouse POT1b DBD formed a discrete complex 
with not only hp-ss’** but also ss’**, which is 
consistent with a 3’-end-binding preference 
(fig. S9, A and B) (7). Our results demonstrate 
that the POT-hole allows POTIa to preferen- 
tially bind the telomeric junction. 


The POT-hole helps maintain the 5'-end identity 
of human chromosomes 


Consistent with the structures we solved, the 
POT-hole of hDBD and mouse POTI1a DBD pro- 
tect the 5’-P end from 5’ exonucleolytic action 
in vitro (fig. S10, A to F). We next asked whether 
the POT-hole helps maintain the 5'-terminal 
sequence of the chromosomes in cells. We used 
a modified single telomere length analysis 
(STELA) approach that uses ligation-mediated 
polymerase chain reaction (PCR) amplification 
to determine the abundance of each of the six 
possible chromosomal 5’-end permutations 
(Fig. 5A) (23). Genomic DNA extracted from the 
parental human embryonic kidney (HEK) 293E 
cell line displayed the expected ATC-5’ prefer- 
ence that is lost after POT/ deletion (Fig. 5, B 
and C). WT hPOTI, but not R83E hPOTI, was 
able to restore the ATC-5’ bias to untreated 
(parental -4-OHT) levels, demonstrating that 
the POT-hole helps maintain the 5’ end of the 
human chromosome (Fig. 5, B and C, and fig. 
S10G). The 5’-end scrambling of hPOT1 R83E 
was less severe than that of POTI KO. This dif- 
ference may be explained by the unleashing of 5’ 
exonuclease activity at telomeres completely 
devoid of POT1 (36). Terminal restriction frag- 
ment (TRF) analysis reproduced previously 
characterized phenotypes (15, 34, 37), including 
the accumulation of slow-migrating species 
(denatured and native blots) and an increase 
in the G-rich ss signal (native blot) upon POTI 
deletion, which were suppressed by expression 
of hPOT1 wild type but not F62A (Fig. 5D and 
fig. SIOH). R83E recapitulated the WT pheno- 
types, suggesting that the end-protection func- 
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tion of the POT-hole is separable from hPOT1’s 
role in bulk-telomere or overhang-length main- 
tenance. Thus, the POT-hole helps maintain 
ATC-5’ ends without acutely influencing telo- 
mere length. 


Discussion 


The major pathway of ATR activation requires 
RPA binding to exposed ssDNA and recogni- 
tion of the ds-ss junction by the 9-1-1/Rad17-RFC 
(RAD9-RAD1-HUS1/Rad17-RFC2-RFC3-RFC4- 
RFC5) clamp and clamp loader, which with the 
MRN (MREI1-RAD50-NBS1) complex recruit 
TOPBP1 (DNA topoisomerase 2-binding pro- 
tein 1) to activate ATR (Fig. 5E) (4, 38). The struc- 
ture of human 9-1-1/Rad17-RFC bound to a ds-ss 
junction revealed a basic pocket in Rad17 that 
is poised to bind the 5’-phosphorylated end of 
a junction by using a mechanism similar to 
that of POT1 (fig. S11, A and B) (39). Con- 
sistent with a competition between POT1 
and 9-1-1/Rad17-RFC in binding the ds-ss 
junction, subunits of the 9-1-1 and MRN com- 
plexes, as well as TOPBP1, are enriched at 
telomeres in the absence of hPOT1 (34). We 
therefore propose that POT1 not only out- 
competes RPA at the telomeric ss overhang 
but also prevents ATR activation by denying 
9-1-1/Rad17-RFC access to the telomeric ds-ss 
junction (Fig. 5E). 

The duplication of POT1 (40), the conserva- 
tion of the POT-hole in POTIa (fig. S12A), the 
disruption of the POT1-hole in POTIb (fig. S12B), 
and the retention of CTC1-STN1-TENI (CST)- 
binding motifs in POTIb (40) within the Muri- 
dae and Cricetidae families of the Rodentia 
order provide support to the hypothesis that 
POTIb relinquished junction binding to facili- 
tate processes at the 3’ end. We propose that 
POTIa wards off 9-1-1/Rad17-RFC at the junc- 
tion, although both POT1a and POTIb paralogs 
could counter RPA at the overhang in mouse 
cells (Fig. 5E). 

The POT-hole is conserved in species dis- 
tant to mammals, such as Sterkiella nova and 
Caenorhabditis elegans (fig. S13A). The precisely 
defined S. nova macronuclear telomere contains 
a5’-C at the ds-ss junction and a 16-nt overhang 
that binds one telomere end-binding protein 
(TEBP)a/8 complex (homologous to the POT1- 
TPP1 complex) (41). TEBPo has been crystal- 
lized with a sulfate ion bound in a manner 
indistinguishable from how the 5'-P binds hDBD 
in our junction-bound structures (fig. S13B) (42). 
Indeed, like hPOT1, TEBPa binds the telomeric 
ds-ss junction more strongly than it binds telo- 
meric ssDNA (8). These observations point to a 
single TEBPa/f complex simultaneously pro- 
tecting the 5’ and 3’ ends of the chromosome 
(8, 41, 42). Schizosaccharomyces pombe, in which 
a POT-hole is not obvious (fig. $13, A and C) 
(5, 43), and eukaryotes whose chromosomes do 
not end in a 5’-C, must have evolved alternative 
approaches for junction protection. 
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We updated the model for how telomeres 
avert detection by the DDR machinery to include 
acritical role of POT1 in binding the telomeric 
ds-ss junction. Thus, POT! protects both DNA 
strands at human chromosome ends by coating 
the G-rich ss overhang and recognizing the phos- 
phorylated 5’ end of the C-rich strand. 
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a characteristic signature in the fine-struc Check for 
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level statistics. This phenomenon differs f-2-- 

IVR in three key respects: (i) It involves the 

“transport” of the molecule frame orientation 


of vector J instead of vibrational energy; (ii) it 
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Ergodicity, the central tenet of statistical mechanics, requires an isolated system to explore all available 
phase space constrained by energy and symmetry. Mechanisms for violating ergodicity are of interest 
for probing nonequilibrium matter and protecting quantum coherence in complex systems. Polyatomic 
molecules have long served as a platform for probing ergodicity breaking in vibrational energy transport. 
Here, we report the observation of rotational ergodicity breaking in an unprecedentedly large molecule, 
22C¢9, determined from its icosahedral rovibrational fine structure. The ergodicity breaking occurs 

well below the vibrational ergodicity threshold and exhibits multiple transitions between ergodic and 
nonergodic regimes with increasing angular momentum. These peculiar dynamics result from the 
molecule’s distinctive combination of symmetry, size, and rigidity, highlighting its relevance to emergent 


phenomena in mesoscopic quantum systems. 


solated systems that break ergodicity have 

been explored in a variety of experimental 

settings, including spin glasses (1), ultra- 

cold neutral atoms (2, 3) and ions (4), and 

photonic crystals (5). These systems ex- 
hibit ergodicity breaking of spin configura- 
tions and momentum or spatial distributions. 
By contrast, gas-phase polyatomic molecules 
provide opportunities to probe the ergodicity 
breaking of collective (rotational and vibra- 
tional) excitations in a finite quantum system. 
In this context, a topic of major interest has 
been the transport of energy deposited into 
molecular vibrations by optical pumping or 
collisions. Intramolecular vibrational redis- 
tribution (IVR) sets in once a critical thresh- 
old, defined by the product of vibrational 
coupling and the local density of vibrational 
states (which has a power-law scaling with 
vibrational energy), is exceeded (6-11). This 
vibrational ergodicity transition has been 
studied vigorously in the context of under- 
standing and controlling unimolecular reac- 
tion dynamics (72). 

Among polyatomic molecules, buckmin- 
sterfullerene ('?C,9) is notable for its structural 
rigidity and high degree of symmetry, which 
suppress IVR and allow for spectroscopic res- 
olution (73) and optical pumping (/4) of indi- 
vidual rovibrational states—an unusual and 
fortuitous situation for a molecule with 174 
vibrational modes. Its small rotational con- 
stant and stiff, cage-like structure ensure that 
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hundreds of rotational states are populated 
even when vibrational excitations are largely 
frozen out, which can be achieved with mod- 
est buffer gas cooling to ~120 K. Thus, a ther- 
mal ensemble of '?Cgo can reveal extensive, 
state-resolved rotational perturbations span- 
ning hundreds of rotational quanta by elimi- 
nating vibrational “hot bands.” 

First observed and understood in atomic 
nuclei (15-22), rotational perturbations can 
arise from spherical symmetry breaking in the 
frame fixed to a rotating self-bound deform- 
able body (23), which lifts the degeneracy of 
different body-fixed projections of the total 
angular momentum vector J. Such perturba- 
tions, also called “tensor interactions” because 
of their anisotropic nature, manifest in fine- 
structure splitting of the total angular momen- 
tum (J) multiplets in rovibrational spectra and 
encode rich dynamics such as rotational bi- 
furcations (18, 24), as previously observed in 
tetrahedral SnH,, CD,, CF4, SiH,, and SiF, and 
octahedral SF, molecules (25-33). Nevertheless, 
observing icosahedral tensor interactions, first 
predicted for Cg, over three decades ago (34), 
has remained an elusive goal, because there 
are far fewer examples of icosahedral mole- 
cules, nonspherical interactions occur only at 
higher orders of interactions, and icosahedral 
molecules are necessarily larger than lower- 
symmetry spherical top molecules, implying a 
smaller rotational constant. 

In this work, we observed these icosahedral 
tensor interaction splittings, revealing rota- 
tional ergodicity transitions in Cg, at energies 
well below its IVR threshold (10). Specifically, 
as the molecule “spins up” to higher J, the 
dynamics of the angular momentum vector J 
in the molecule-fixed frame switches between 
ergodic and nonergodic regimes. In the non- 
ergodic regime, distinct vector J trajectories 
exist in the same energy range but remain 
separated by energy barriers. In the limit of 
high J, the tunneling between these trajecto- 
ries is too weak to restore ergodicity, leaving 


can occur well below the IVR threshold; and 
ii) it switches back and forth multiple times 
between ergodic (described by a 6th rank ten- 
sor interaction) and nonergodic (described by 
a 10th rank tensor interaction) regimes as the 
angular momentum is varied. This peculiar 
dynamical behavior arises from multiple 
avoided crossings with other vibrational states, 
which induce nonmonotonic variations in the 
molecule’s anisotropic character as J is varied. 
The rotational ergodicity transitions bear some 
similarity to those studied in asymmetric top 
molecules in a static electric field (35, 36) in 
that both concern the transport of angular 
momentum in the molecule frame. However, 
unlike in (35, 36), the rotational ergodicity 
transitions in Co are induced by intramolec- 
ular rovibrational coupling in the freely rotat- 
ing molecule, rather than by an externally 
applied electric field. 


Effective '*C¢9 rovibrational Hamiltonian 


The rovibrational structure of Cgq can be de- 
scribed by a field-free molecular Hamiltonian 


= A scalar + Fensor (1) 


The scalar Hamiltonian Acajar contains only 
those combinations of vector J and vibrational 
angular momentum / that preserve their spheri- 
cal degeneracy (37) 


Hocatar = Vo + BS? + DI* + +++ — 2BCS - & (2) 
where vo is the vibrational band origin, B is 
the rotational constant, D is the scalar cen- 
trifugal distortion constant, and C is the Coriolis 
coupling constant. 

Rovibrational fine structure is encoded in the 
tensor Hamiltonian Atensor. For simplicity, we 
considered a pure rotational tensor Hamiltonian 
consisting of the two lowest-order “icosahe- 
dral invariants.” These invariants are linear 
combinations of spherical tensors of the same 
rank that transform according to the totally 
symmetric irreducible representation in the 
icosahedral point group (I;,) (38). They can be 
expressed (39) in the basis of spherical har- 
monics Y: és (8, ) of degree & and order g, which 
depend explicitly on the molecular frame’s 
polar 8 and azimuthal angles (Fig. 1A). The 
first two nontrivial (anisotropic) icosahedral 
invariants, with ranks 6 and 10, are given by 
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Fig. 1. Rotational energy surfaces and eigenvalues corresponding to icosahe- 
dral invariant spherical tensors. (A) Symmetries of Ceo. (Left to right) (1) Ball- 
and-stick model of Cgo, with the three different types of rotational symmetry axes 
that label stationary points on the rotational energy surface (RES). The degeneracies 
of the stationary points are listed in parentheses. Color and plot marker coding 
for each type of rotational symmetry axis are shown. (2) Body-fixed coordinates: 
View along Cs, C3, and C> rotational 
symmetry axes. (B to F) (Top panels) RESs, defined by their radii r(@, o) = 


1+ HE+D) (y. 6, )/2, for v ranging from O to m. Eigenvalues of 1 + HE) (y) 72 


polar 8 and azimuthal o angles. (3-5 


tensor 


Vll-19 


(¥2°(8, 6) — ¥12(6, 6) 


)+¥%o,)) 
(4) 


which are combined to construct a truncated 
tensor Hamiltonian 


(5) 


This Hamiltonian is parameterized by an 
overall scaling factor y and mixing angle v such 
that v = Oandv = n/2 correspond to pure 7! 
and pure T°), respectively. 

All operators that are incompatible with 
icosahedral point group symmetry, including 
any spherical harmonics of rank 1 to 5, 7 to 9, 


Ho = y(cosv7'* ie sinvT"”)) 
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tensor 


11, 13, 14,... (40), vanish from the Hamiltonian. 
The use of full rovibrational tensor operators 
is unlikely to change the picture qualitatively, 
particularly when J (~100 to 300) is much 
greater than the vibrational angular momentum 
quantum number / = 1 (38). These polyhedral 
invariants are similar to those used to describe 
the crystal field splitting of electronic orbitals 
owing to an external lattice environment (47) 
or the ligand field splitting in transition-metal 
complexes (42). 

The energetic correction, or tensor energy 
defect, associated with orienting J in different 
directions in the molecule frame can be vi- 
sualized by the altitude of a semi-classical “rota- 
tional energy surface” (RES), defined at a fixed 
J. Various possible icosahedral RESs, defined 
by their radii r(0, 0) =1+ HEX (v: 6, )/2, 
corresponding to different mixing angles v, are 
plotted for J = 174 in the top panels of Fig. 1, B 
to F. Stationary points always lie on Cg, Cs, or 


character. (Center panels) Tensor energy defects [eigenvalues of H, 
range of J. The gray vertical line highlights J = 174. Eigenvalues are plotted using 
the marker corresponding to their dominant Cs, C3, or Co character (A). Near 

the separatrices, the assignment is somewhat ambiguous owing to strong mixing. 
(Bottom panels) Distribution p(r) of energy gap ratios r (see text) calculated from 
tensor energy defect spectra aggregated over J = 0 to 400. Away from v = 0,72 [(C) 
to (E)], the finite value of p(r) as r — 0 is a signature of ergodicity breaking. 


calculated for the fully symmetrized J = 174 subspace are plotted on the surface 
as radial contours, colored corresponding to their dominant rotational symmetry 


(6+10) 


tensor (V)] over a 


C; rotational symmetry axes. However, as v 
varies, they change in character between mini- 
ma, maxima, and saddle points. The RES dic- 
tates the dynamics of J in the molecule frame 
(30, 43-46), analogous to how an adiabatic 
potential energy surface steers the relative 
motions of nuclei (47). During free evolution, 
the trajectory of J follows an equipotential 
contour of the RES. 

In a full quantum-mechanical treatment, 
the perturbation Higto ) leads only to discrete 
tensor energy defects e; given by the eigen- 
values of HE) in a fully symmetrized fixed-J 
subspace. These orbits trace out the closed 
contours on the RES in Fig. 1. The orbits may 
also be obtained directly from the RES: They 
are the trajectories that both (i) satisfy a Bohr 
quantization condition (44, 48) and (ii) trans- 
form according to the totally symmetric irreduc- 
ible representation of the icosahedral point 
group (38). The latter condition accounts for the 
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quantum indistinguishability of each bosonic 
nucleus in the '”Cgo isotopolog (13, 49) and 
is analogous to the selection of odd or even 
rotational states in ortho- or para-hydrogen 
molecules, respectively. The tensor energy de- 
fects are plotted for a range of J in the center 
panels of Fig. 1, B to F, and may be expected 
to appear in the rovibrational fine structure 
of C60. 


Resolving 2¢..9 rovibrational fine structure 


In the preceding discussions, we have focused 
solely on the geometric effects of icosahedral 
symmetry. In general, however, the measured 
tensor defect spectra may exhibit additional 
J-dependent scaling and offsets, which depend 
on intramolecular couplings specific to Cgo. 
To resolve rovibrational perturbations in Co, 
in this work, we explored the P-branch region 
of the 1185 cm" T,,, (3) band, first identified 
as aregion of potential interest in (13). Using 
cavity-enhanced continuous-wave (cw) spec- 
troscopy with a quantum cascade laser (QCL) 
source, we achieved a minimum absorption 
sensitivity Omin = 2.1 x 10-°em7! Hz-¥/?, or 
1000-fold better detection sensitivity per spec- 
tral element than in (13) (amin = 2.2 x 1077 
em~! Hz~¥/? per comb mode). We acquired 
600-MHz-wide absorption spectra by simulta- 
neously scanning the QCL frequency and free 
spectral range of the enhancement cavity across 
molecular absorption lines, and recording the 
frequency-dependent absorption. These spec- 
tra were stitched together by a combination of 
direct calibration of the QCL frequency with 


a Fourier transform spectrometer and com- 
parison to matching spectral features in the 
broadband, low signal-to-noise (SNR) frequency 
comb spectrum of (73). We obtained an absolute 
frequency accuracy of ~6 MHz throughout the 
entire measured frequency range, limited by 
the resolution of the reference frequency comb 
spectrum. Finally, to ensure consistency of 
the absorption signal over multiple days of 
data collection, we have periodically remea- 
sured the molecular absorption feature at 
RV = 181) atv = 1186.27cm ‘and normalized 
all data taken around the same time to its line 
strength and measured cavity finesse. 


Assigning ?C¢o rovibrational fine structure 


The culmination of these efforts is the infrared 
spectrum in Fig. 2, spanning the spectral re- 
gion from 1182.0 to 1184.7 cm”. At J $70, there 
is a regular progression of rotational lines, 
similar to those in the R-branch (73). They 
rapidly split into intricate patterns before 
merging at and beyond J = 300. Zooming into 
the region labeled B), the rotational line cen- 
ters could be fit to the scalar part of Eq. 1, as 
was done in (13). 


v?(J) = vo + BJ — (J + 26) 

— B'I(J +1) (6) 
where J here refers to the ground-state total 
angular momentum. The scalar centrifugal dis- 
tortion term was not significant at our spec- 
tral precision and range of J. The fit yielded 
B" = 0.0028 cm™! for the ground-state rota- 


tional constant, B’ = B” — 2.876 x 10-7 cm? 
for the excited-state rotational constant, ¢ = 
—0.37538 for the Coriolis coupling constant, 
and vo = 1184.85 em™. Equation 6 yields a 
progression of rotational lines with a spacing 
of ~2B(1—¢), where B= (B'+B")/2. The 
spectroscopic constants were underdetermined 
and only served to facilitate J-assignment of 
the peaks in a manner consistent with the 
R-branch assignments of (13). The extrapo- 
lated P-branch spectral line positions based 
on this scalar fit are plotted as gray vertical 
lines in Fig. 2, B to F, with every fifth J value 
labeled in red. The agreement with the mea- 
sured line positions is excellent in the region 
J < 70, where the spectrum appears unper- 
turbed (Fig. 2B). 

To confirm our J assignment, we compared 
the peak absorption cross sections to the nu- 
clear spin weights of the ground-state rota- 
tional levels and found that they match well. 
Finally, we applied a frequency-dependent 
scaling factor to the raw absorption spectrum 
(27 +1) 7e2 J+) /k? | This scaling removes 
the effects of lab frame angular momentum 
degeneracy and the thermal ensemble, em- 
phasizing the dynamics in the molecule-fixed 
frame. 

The peaks were identified manually and 
circled in blue. Figure 2, C and D, show two 
representative regions, at J ~ 90 and J ~ 170, 
respectively, where peaks could still be indi- 
vidually resolved. The local peak density again 
matches the predicted nuclear spin weights 
(in blue), confirming that the rovibrational 
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Fig. 2. Direct continuous-wave (cw) absorption spectroscopy of Ceo 
P-branch. (A) Complete normalized cw spectrum of P-branch to expose 

the J dependence of the nuclear spin weights. Red highlighted regions are 
shown in greater detail in subsequent panels. (B to F) Enlargement of 

red highlighted regions in (A). (B) Enlargement of low-J region of the P-branch. 
This relatively unperturbed region of the P-branch is fit to a rigid-rotor 
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Hamiltonian to obtain the rotational spacing 2B(1 — ¢). Fitted line positions 
are indicated by the gray vertical grid lines, with J labeled in red. Blue 
numbers denote calculated nuclear spin weights, which match the measured 
peak intensities well. In (C) and (D), peaks are resolvable and marked with 
blue circles. In (E) and (F), spectral congestion prohibits identification 
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fine-structure splitting originates from icosahe- 
dral tensor interactions of Eq. 5 that lift K- 
degeneracy. Figure 2, E and F, show two 
regions where the peaks have begun to merge, 
and individual peaks can no longer be easily 
identified (J > 247). 

Interpreting the tensor splittings requires 
assigning each absorption peak to a partic- 
ular J. We began by assigning each peak to 
its nearest J value according to the scalar fit 
of Eq. 6. Subtracting the scalar contribution 
(Eq. 6) from the central frequency of each 
peak generated a single “period” of a Loomis- 
Wood-like defect plot in Fig. 3A. There remains 
some ambiguity in the defect assignments, as 
illustrated in the inset of Fig. 3A: Each defect is 
constrained to the line with slope 2B(1— ¢), 
which passes through its current position. 

By carefully rearranging individual defects 
according to these discrete allowed “moves,” 
we unwrapped five distinct regions that ex- 
hibit continuous-looking patterns (Fig. 3B). 


Because of the discontinuities at J ~ 80, 110, 
160, and 220, there was still some ambigu- 
ity in the overall shift of the four perturbed 
sections labeled (i) to (iv). To remove this 
ambiguity, we recognized each section’s dom- 
inant pure-rank tensor character as follows: 
dG) —T!; Gi) TP; Git) TM; Gv) —T!. 
Eigenvalue spectra in Fig. 1, B, D, and F, show 
that the extremal eigenvalues associated with 
C,, rotational symmetry axes occur when J is 
an integer multiple of n. The J-assignment 
depicted in Fig. 3B satisfies this condition 
for all sections simultaneously. This final J- 
assignment was confirmed by the excellent 
agreement between J-resolved peak counts 
and calculated nuclear spin weight far from 
the discontinuities Fig. 3C. 


Rotational ergodicity transitions 


The J-dependent tensor energy defects imply 
rovibrational dynamics of Cg. To infer these 
dynamics, we parameterized each of the four 


perturbed regions (i) to (iv) in Fig. 3B in terms 
of a mixed tensor (Eq. 5), J-dependent scaling 
b(J), and J-dependent scalar offset a(J): 


a(J) + b(J) x e(v; J, K) (7) 


where e(v;J,K) are the J-dependent tensor 
splittings as plotted in the lower panels of Fig. 1, 
BtoF. 

First, a(J) was obtained from the observed 
mean defect of each section. Next, by perform- 
ing a point-cloud registration (50-52) to the 
theoretical eigenvalue spectra and the mea- 
sured defect plot, we assigned a best-fit mixing 
angle v to each section: (i) 1; (ii) 0.457; (iii) 1.91; 
and (iv) x (38). Finally, b(J) was obtained from 
a least-squares fit to a polynomial in J (38). 
The resulting reconstructed defect plot for re- 
gions (i) to (iv) is shown in Fig. 3D. 

The abrupt changes in mixing angles v are 
associated with transitions between ergodic 
and nonergodic rotational dynamics as the mo- 
lecule “spins up” to higher J. These dynamics 
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Fig. 3. Obtaining P-branch tensor energy defects versus J. (A) Plot of 
experimental energy defects from nearest-neighbor J assignment. Peaks are 
assigned to nearest J according to rigid rotor model (gray vertical grid lines in 
Fig. 2, B to F). Red defect points (J > 247) correspond to peaks in the absorption 
spectrum that are no longer individually resolved. Their peak centers are 
obtained from fitting to a cluster of Voigt lineshapes (38). (Inset) Unwrapping 
procedure follows a series of allowed “moves” that simultaneously translate 
points in vertical steps of +2B(1- ¢) and horizontal steps of +1. (B) “Unwrapped” 
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J 


Blue highlighted 


P-branch spectrum based on eigenvalue spectra of mixed tensor operator H 
Four perturbed portions of (B) are reproduced with four mixing angles v. 


defect plot. Four avoided crossings with varying strengths are seen at J ~ 80, 110, 
160, and 220. Defect patterns resemble eigenvalue spectra of icosahedral invariant 
tensors. (€) Calculated nuclear spin weights overlaid with measured peak counts. 
sections show excellent agreement between calculated nuclear spin 
weights of Ceo and assigned peak counts. (D) Reconstruction of perturbed sections of 


(6+10) 
tensor (v). 
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transition from ergodic for region (i), to 
nonergodic for region (ii), and back again for 
regions (iii) and (iv). 

The origin of this ergodicity breaking can 
be understood semi-classically using the RESs 
in Fig. 1, B, D, and F. The dynamics of vector 
J are ergodic when time evolution explores 
the full space of symmetry-allowed states at 
the same energy. For region (iii), the dominant 
tensor character is T'®!. Naively, the existence 
of 12 disconnected trajectories encircling the 
C; axes breaks ergodicity. However, these traj- 
ectories cannot be distinguished in Cgo: Ow- 
ing to the indistinguishability of the °C nuclei, 
all 12 semi-classical trajectories correspond 
to a single quantum state given by their fully 
symmetric superposition. As such, the vector 
J dynamics do explore the full range of states 
at a given energy, and hence are ergodic. The 
same argument applies for regions (i) and 
(iv), which differ from (iii) only in the sign of 
tensor energy defects (Fig. 1F). 

By contrast, for region (ii), the dominant 
tensor character is T"°!. The C; and C3 axes 
both correspond to peaks on the RES and to 
host trajectories over the same range of ener- 
gies (Fig. 3D). Trajectories encircling the C; 
and C; axes are distinguishable, and hence cor- 
respond to distinct quantum states. The quan- 
tum tunneling between these trajectories is 
unable to restore the ergodicity: The tunneling 
integral between C,; and Cz is exponentially 
small in J (53, 54), whereas the level spacing 
only scales as 1/J. The scaling of the tunnel- 
ing integral follows from standard Wentzel- 
Kramers-Brillouin (WKB) results (55), and the 
scaling of level spacings can be seen by compar- 
ing the relatively fixed bandwidth of tensor en- 
ergy defects (Fig. 3B) with the nuclear spin weight 
~(2J + 1)/60. Consequently, for our measured J 
(well within the large-J limit), tunneling correc- 
tions are typically only perturbative. 

Energy-level statistics provide a simple probe 
of this ergodicity breaking in molecular spec- 
tra (56, 57). Quantum ergodicity is associated 
with eigenstates extended in phase space that 
can be strongly coupled by local perturbations, 
inducing energy-level repulsion. By contrast, 
ergodicity breaking is associated with the ex- 


istence of localized eigenstates, which are not 
strongly coupled by perturbations and whose 
energies are uncorrelated (58). Ergodic and 
nonergodic dynamics are therefore respec- 
tively associated with level repulsion and its 
absence (59). A useful diagnostic tool is the 
distribution p(7), where 7 is the ratio of consec- 
utive level spacings e; (60, 61): 


_ {8 Sie 
rs = min( == ‘) (8) 
Si Si 


Si = i414 — Gj (9) 


In the limit of 7 — 0, level repulsion in an 
ergodic system causes p(7) — 0, and for a non- 
ergodic system p(7) — constant. Similar energy- 
level statistics have been used to analyze systems 
as diverse as nuclear spectra (59), ultracold 
atomic scattering (62), and many-body local- 
ization (63). 

Figure 4, A to C, show the energy-gap ratios 
computed from sections (i) to (iii), respec- 
tively, of the experimental defect plot (Fig. 3B). 
Here, sections (i) and (iii) exhibit persistent 
level repulsion characteristic of ergodicity, 
whereas section (ii) does not, indicating non- 
ergodicity. We aggregated the energy-gap ratios 
over each one of sections (i) to (iii) and their 
respective distributions in Fig. 4, D to F. These 
distributions confirm the presence of level re- 
pulsion in sections (i) and (iii) and its absence 
in section (ii). 

The physical origin of rovibrational tensor 
energy defects in Cgo can be inferred from 
Fig. 3B. The defects arise from rovibrational 
coupling between the bright P-type excited 
T\*)(3) state and a background of perturb- 
ing dark states. Specifically, both the disconti- 
nuities and excess observed peaks at specific 
J values are characteristic of avoided cross- 
ings with perturbing zero-order dark vibrational 
combination states. As they cross sh ) (3) from 
below, rovibrational coupling lifts the degen- 
eracy of J multiplets in the TH (3) state, im- 
parting tensor character that depends on the 
identity of the perturbing vibrational state. 
The tensor character of the Ti) (3) state (spe- 
cifically the fitted v values) is stable in between 


avoided crossings, suggesting that each of 
these sections is dominantly affected by just 
one perturbing state at a time. At the avoided 
crossings, this assumption breaks down, as 
made particularly evident by the rapid change 
in mixing angle just before and after the 
avoided crossing atJ = 160 of Fig. 3B. There 
is no apparent structure to the changes in 
mixing angle and coupling strength induced 
by the different avoided crossings, suggesting 
that the perturbing dark states are distinct 
and not part of the same Coriolis manifold. 
Finally, using the observed local density of per- 
turbing states p,,, * 2/em™! and average mea- 
sured vibrational coupling strength (bandwidth 
of the avoided crossings) Of Bay. * 2x 107? em! 
(38), we arrive at an IVR threshold parameter 
GO) T(E) = \/2n/3paxBag ¥ 0.06 < LOur Ceo 
rotational ergodicity transitions are observed 
well below the IVR threshold, a conclusion sup- 
ported by the spectroscopically well-resolved 
Ty(3) band. This further highlights the fact 
that, although the rotational ergodicity tran- 
sition relies on intramolecular vibrational cou- 
pling, it is completely distinct from IVR. 


Conclusion 


We have measured and characterized icosahe- 
dral tensor rovibrational coupling in ’Cgpo. 
Analysis of the spectrum of rovibrational ten- 
sor energy defects revealed that as the mol- 
ecule is spun up to higher J, there is a series 
of transitions in the dynamical behavior of 
J in the fixed body frame. Specifically, vec- 
tor J switches between ergodic and nonergodic 
behavior at particular J values, leaving a char- 
acteristic imprint on the defect-level statistics. 
These ergodicity transitions arise from dark 
vibrational combination states that cross the 
Tt } (3) state at particular J values, transferring 
their anisotropic character onto the T (3) 
state through rovibrational coupling. 

Our measurements open the door to a rich 
hierarchy of emergent behavior in Cgo isotopo- 
logs, accessible at ever-higher spectral resolution. 
The small nuclear spin-rotation interaction— 
for example, in *C-substituted isotopologs of 
Cego—can have a magnified effect owing to the 
small superfine splittings near RES extrema. 
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Fig. 4. Energy-level statistics in ergodic and nonergodic regimes. (A to C) (Left panels) Gap ratio r as a function of J calculated from sections (i) to (iv) of the 
defect spectrum (Fig. 3B). Gap ratios are only plotted at J values for which the peak counts match the calculated nuclear spin weight (Fig. 3C). (Right panels) 
Normalized distribution p(r) of gap ratios aggregated over sections (i) to (iii). Note the change from logarithmic to linear r scale between left and right panels. Sections 
(i) and (iii) exhibit level repulsion, a signature of ergodicity, whereas section (ii) does not, indicating nonergodicity. 
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Such “hyperfine” coupling can lead to spon- 
taneous symmetry breaking in a finite system 
(31, 64). These insights could prove useful for 
exploiting the exotic orientation state space 
of Cgq for quantum information processing 
(65) and for investigating the quantum-to- 
classical transition of information spreading 
(66). Ultimately, spectroscopy of Cgq isotopo- 
logs at ever-higher spectral resolution promises 
to uncover deeper insights into the emergent 
dynamics of mesoscopic quantum many-body 
systems. 
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Ligand-protected metal nanoclusters as low-loss, 
highly polarized emitters for optical waveguides 


Xiaojian Wang’t, Bing Yin't, Lirong Jiang't, Cui Yang”, Ying Liu’, Gang Zou, 


Shuang Chen", Manzhou Zhu!* 


Photoluminescent molecules and nanomaterials have potential applications as active waveguides, 
but such a use has often been limited by high optical losses and complex fabrication processes. 
We explored ligand-protected metal nanoclusters (LPMNCs), which can have strong, stable, 

and tunable emission, as waveguides. Two alloy LPMNCs, Pt,Agig and Au,Agi9_, (7 < x < 9), 
were synthesized and structurally determined. Crystals of both exhibited excellent optical 
waveguide performance, with optical loss coefficients of 5.26 x 10-3 and 7.77 x 10°? decibels 
per micrometer, respectively, lower than those demonstrated by most inorganic, organic, and 
hybrid materials. The crystal packing and molecular orientation of the Pt,Agis compound led to 
an extremely high polarization ratio of 0.91. Aggregation enhanced the quantum yields of Pt,Agig 
and Au,Agi9-, LPMNCs by 115- and 1.5-fold, respectively. This photonic cluster with low loss and 
high polarization provides a generalizable and versatile platform for active waveguides and 


polarizable materials. 


ptical waveguide systems can be made 

of passive elements based on refractive 

index changes, as in an optical fiber, or 

of active elements that create gain or 

introduce nonlinear optical effects for 
signal amplification (/-3). For molecular ac- 
tive optical waveguides, dipole orientations 
affect the direction of photon transmission. 
(4-9) Reported active waveguide materials, 
such as organic chromophores, hybrid mate- 
rials, and polymer materials, have had draw- 
backs such as high loss, complex synthesis, 
and low yield (6, 8-75). Ligand-protected metal 
nanoclusters (LPMNCs), which are composed 
of a few to hundreds of metal atoms and sur- 
face ligands (16-21), are suitable for optical 
waveguide materials (16, 22). They have strong, 
stable and tunable emission, good photostability, 
a large Stokes shift, and high quantum yields 
(QYs) (23, 24) and can be synthesized in high 
purity and yield. For waveguide applications, 
aggregation-induced emission enhancement 
could further enhance the photoluminescence 
(PL) of LPMNCs (25, 26). 

We report the optical waveguide performance 
of two LPMNCs, [Pt,Ag;3(S-Adm).(DPPP)¢Cl¢] 
(SbF,)(AgCl.) (hereafter referred to as Pt,Agys, 
where S-Adm is adamantane mercaptan and 
DPPP is 1,3-bis-diphenylphosphine propane) 
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Engineering of Advanced Materials, Anhui Province Key 
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and Engineering, University of Science and Technology of 
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and [Au,Agy9_,(S-Adm).(DPPP)¢Cl¢](ClO4)3 
(hereafter referred to as Au,Ag9_2, where 7 < x 
< 9), both of which had rod-like structures and 
strong emission properties. The emission spec- 
tra of Pt,Ag;, and Au,Agi9_, were orange and 
red and were centered at 600 and 770 nm, 
respectively. One-dimensional (1D) microrod 
crystals of these LPMNCs exhibited excellent 
optical waveguide performance, with low opti- 
cal loss coefficients and distinct polarized wave- 
guide performance. The polarization ratios of 
Pt,Ag;s and Au,Agi9_~ were 0.91 and 0.17, re- 
spectively, which we attributed to the differ- 
ences in their crystal structure and packing 
modes. Both LPMNCs displayed aggregation- 
induced emission enhancement in that the 
QYs of Pt,Agig and Au,Agi9_, in the solid state 
were 115 and 1.5 times higher than those in 
solution, respectively. We propose that Pt,Agis 
and Au,Agi9_, along with other LPMNCs, could 
find applications in optoelectronic devices. 


Crystal structure of Pt,Agig 


We prepared Pt,Ag), ligand-protected nano- 
clusters (NCs) by reacting Pt,Agog NCs (27) with 
Ago(DPPP)Cl, complexes in dichloromethane 
(see the supplementary materials), and their 
structure was determined to be [Pt,Agg(S- 
Adm)s(DPPP)¢Clg](SbFg)(AgCl,) by single- 
crystal x-ray diffraction (SCXRD) (table $1 and 
Fig. 1). The Pt,Ag;, structure consisted of a 
kernel composed of a central Pt atom and 12 Ag 
atoms (Fig. 1A) surrounded by a shell com- 
posed of six DPPP ligands (Fig. 1C) and two 
crown-like Ag3Cl(S-Adm), staple motifs (Fig. 
1B). The two Ag3Cl3(S-Adm), motifs of the 
surface shell were distributed on each side of 
the Pt,Agy. kernel, and each motif was con- 
nected to the Ag atoms in the kernel through 
three Ag-Cl bonds with a bond length of 2.44. A, 
forming a rod-like Pt;Ag;sCle(DPPP).(S-Adm), 
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dat 
DPPP ligands bonded to the Ag atoms in area 


the crown-like motif and the Pt,Ag,. kernel 
in a form parallel to the Pt,Ag,.Cle(S-Adm), 
rod (Fig. 1C), leading to the total structure of 
[Pt,Agys(S-Adm).(DPPP)¢Cl¢]°* (Fig. 1H). 

Each crystal cell contained two Pt,Ag;3 mo- 
lecules with the counterions SbF, and AgCl,— 
(Fig. IM); one was at the apex of the unit cell 
(Pt,Agjs-1, yellow highlights in Fig. 1M), and 
the other was in the center (PtiAgys-2, pink 
highlights in Fig. 1M). An important difference 
between the two Pt,Agj, clusters in each unit 
cell was the pattern of their associated DPPP 
benzene rings (Fig. 1H), which resulted in differ- 
ing intermolecular and intramolecular noncova- 
lent interactions. In Pt,Ags-1, 1... interactions 
within each DPPP ligand and between adja- 
cent DPPP ligands were observed (Fig. 1D), 
whereas in PtAg)s-2, 1...1 interactions were 
only observed between two adjacent DPPP 
ligands (Fig. IF). The bonding patterns of the 
three DPPP methylene groups in Pt,Ag;.-1 
(Fig. LE) and Pt,Agy,-2 were also different 
and led to differences in noncovalent inter- 
actions. The C-H bonds of the DPPP methy- 
lene groups interacted with the benzene x 
electrons, forming C-H...2 secondary bonds 
for PtyAg)s-1 and Pt,Ag);-2 (Fig. 1, Eand G, 
respectively). 

In layered PtyAgys-1 and Pt,Agys-2, z...7 
intermolecular interactions were evident be- 
tween the top and bottom benzene rings of the 
DPPP ligands (Fig. 11), and the C-H bonds of 
the benzene rings in Pt,Ag),-2 interacted with 
m electrons in PtyAgjs-1 (Fig. 1J), which to- 
gether induced C-H...z...1 intermolecular 
interactions (Fig. 1K). Such interactions in- 
creased the rigidity and stability of Pt,Agis, 
facilitating its crystallization, and were sim- 
ilar to the interactions associated with the self- 
assembly of organic molecules. From the [001] 
crystallographic direction, PtyAgys-1 and 
Pt,Agjs-2 alternated to form an orthohexag- 
onal pattern (Fig. IL, left). From the [100] crys- 
tallographic direction, layered stacking of the 
Pt,Agjs-1 and Pt,Agi.-2 orthohexagonal struc- 
tures can be observed (Fig. 1L). Thus, the 
differing intermolecular and intramolecular 
noncovalent interactions of the benzene rings 
affected the crystallization and self-assembly 
of the LPMNCs. 


Crystal structure of Au,Agi9-, 


Au,Agi9_z NCs were prepared in a one-pot 
synthesis. A mixture of DPPP and HS-Adm 
was added to a solution of AgNO; in ethanol, 
and then an aqueous solution of HAuCl, was 
introduced, followed by an aqueous solution 
of NaBH, (see the supplementary materials), 
and their structure was determined to be 
[Au,Agig-(S-Adm)(DPPP)6Cle (C104); by single- 
crystal x-ray crystallography (table S2 and 
Fig. 2). The Au,Agj9_, structure consisted 
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Fig. 1. Crystal structure of PtyAgig. (A to C) Pt;Agis kernel (A), Ag3Cl3S 
motif (B), and structure (C) of the Pt;AgigCle(DPPP)¢(S-Adm)z rod. 

(D) Intramolecular x... interaction in PtyAgig-1. (E) Intramolecular C-H...2 
interaction in PtyAgig-1. (F) Intramolecular x... interaction in PtyAgig-2. 
(G) Intramolecular C-H...2 interaction in PtyAgig-2. (H) Overall structure of 
PtAgis-1 [left, yellow highlights in (L) and (M)] and PtyAgig-2 [right, pink 
highlights in (L) and (M)]. (1) Intermolecular x... interaction between 


of an Au,Agg kernel and an (AuAg),Cl¢(S- 
Adm)s(DPPP)¢ shell (Fig. 2C). In the Au;Agg 
kernel, Ags, Au;, and Ags were stacked se- 
quentially in layers (Fig. 2A). The shell was 
composed of two crown-like M3Cl3(S-Adm), 
staple motifs (Fig. 2B, where M is a metal, i.e., 
Au or Ag) and six DPPP ligands (Fig. 2C). The 
three metal sites in each of the two M;Cl,(S- 
Adm), staple motifs were occupied by either 
Au or Ag. The two M;Cl;(S-Adm), motifs were 
connected to the Ag atoms on the Au,Agg 
kernel through three M-Cl bonds with a bond 
length of 2.36 A. Six DPPP ligands bonded 
to the metal atoms in the staple motifs and 
the exterior Au atoms of the Au7Ag, kernel 
through their P atoms. The P-Au-S bond angle 
was larger than the P-Ag-S bond angle; the 
average angles were 173.56° and 144.20°, re- 
spectively. The resulting total structure of 
[Au,,Agi9_.(S-Adm).(DPPP)¢Cl¢ ?* ig shown in 
Fig. 2D. 

Each crystal cell contained two Au,Agj9_» 
molecules with six ClO, counterions (Fig. 2N). 
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Unlike Pt,Agj,, the patterns of the associated 
benzene rings of the two Au,Agi9-, mol- 
ecules in each unit cell were the same (Fig. 
2D). However, the orientations of the two 
Au,Agi9_-~ NCs were different. In a single mo- 
lecule of Au,Agj9_,, 1... interactions in a 
single DPPP ligand and between two DPPP 
ligands were observed (Fig. 2, E to G). The 
C-H bonds from the DPPP methylene groups 
also interacted with the benzene zx electrons, 
forming a C-H...2 interaction between two 
ligands (Fig. 2, H to J). The C-H bonds of 
the benzene rings in each Au,Agj9-, NCs 
formed either C-H...z, C-H...x...H-C, or 
C-H...n...2 interactions with the x electrons 
in the other adjacent Au,Ag9_. NCs (Fig. 2, K 
and L). Such interactions facilitated crystal- 
lization. The different metal atoms and different 
intermolecular and intramolecular interac- 
tions of Pt,Ag;s and Au,Agj9_, led to different 
packing patterns. Au,Agj9_, formed a zigzag 
pattern along the [100] crystallographic direc- 
tion (Fig. 2M). 
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Pt,Agig-1 and Pt;Agig-2. (J) Intermolecular C-H...x interaction between 
Pt,Agig-1 and Pt,Agig-2. (K) Interaction between Pt,Agig-1 and Pt,Agi.-2. 

(L) Orthohexagonal packing pattern for PtyAgig-1 and Pt,Agig-2. (M) A unit cell 
of Pt:Agis. C and H atoms have been omitted for clarity in (A) to (C). H 

atoms have been omitted for clarity in (H). C, H, and P atoms have been omitted 
for clarity in the right side of (M). The intramolecular and intermolecular 
interactions are indicated by red dotted lines in (D) to (G) and (I) to (K). 


Characterization of PtjAgig and Au,Agio9-_, 
The Pt,Agig and Au,Agy9-7 NCs were further 
characterized by ultraviolet-visible (UV-vis) 
spectroscopy, electrospray ionization mass spec- 
trometry (ESI-MS), x-ray photoelectron spec- 
troscopy, and thermogravimetric analysis. The 
UV-vis absorption spectrum of Pt,Agis (fig. SI) 
includes peaks centered at ~330, 365, and 
490 nm (fig. S1A); the energy difference be- 
tween highest-occupied and lowest-unoccupied 
molecular orbitals (i.e., the HOMO-LUMO gap) 
was estimated to be 1.92 eV (fig. S1B). The 
UV-vis absorption spectrum of Au,Agj9_. had 
peaks centered at ~353, 427, 483, and 545 nm 
(fig. S2A), corresponding to a HOMO-LUMO 
gap of 1.97 eV (fig. S2B). The multiple absorp- 
tion peaks revealed the quantum size effect of 
Pt,Agis and Au,Agi9_, NCs, which is different 
from the surface plasmon resonance of 
nanoparticles. 

The ESI-MS spectrum of Pt,Ag;; had a mass- 
to-charge (m/z) peak centered at 2579.69 (fig. 
83), which corresponded to [Pt,Ag;s3(DPPP). 
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A 


Fig. 2. Crystal structure of Au,Agio_,. (A to ©) AuzAge kernel (A), M3Cl3(S-Adm), 
motifs (B), and structure (C) of AuyAgio-,Cle(DPPP)¢(S-Adm)2. (D) Overall 
structure of Au,Agi9_y. (E to G) Intramolecular z...2 interaction in Au,Agyo_,. 

(H to J) Intramolecular C-H...2 interaction between two DPPP ligands. 

(K and L) Intermolecular C-H...2, C-H...2...H-C, and C-H...1...2 interaction in 


(S-Adm),Clg]?* (calculated formula weight: 
2579.58). The ESI-MS spectrum of Au,Agi9_» 
showed a series of six peaks corresponding to 
[Au,Agi9-.(S-Adm),(DPPP)gClg(ClO,),]"* and 
[Au,Agy9_.(S-Adm).(DPPP)¢Cl¢]?*, with 7 < 
x < 9 (fig. S4). These data were consistent 
with the results obtained by SCXRD. x-ray 
photoelectron spectroscopy of Pt,Agys (figs. S5 
and S6) and Au,Agio9-» (figs. S7 and S8) con- 
firmed the presence of all of the expected 
elements. Thermogravimetric analysis of PtAgis 
showed a total weight loss of 54.7 wt % (fig. 
S9A), which is consistent with the theoretical 
loss (53.96 wt %) calculated based on the for- 
mula determined from x-ray crystallography. 
Similarly, the total weight loss of 49.64% 
is consistent with the theoretical value for 
[Auj0.1A88.9(S-Adm)2(DPPP)¢Cl¢](C104)3 
(fig. SOB). 


Pt,Agig and Au,Agi9-, microrods as 
optical waveguides 


We investigated the photonic properties of 
Pt,Ag;g and Au,Agy9_. NCs by optical wave- 
guide. Both Pt,Ag;, and Au,Agj9_~ NCs readily 
formed defect-free single crystals with smooth 
surfaces (fig. S10) through multiple noncova- 
lent interactions within their structures and 
their high synthesis purities, thereby ensuring 
high PL efficiency and minimal light scatter- 
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ing from the domain boundary. Fluorescence 
microscopy was applied to hexagonal sheets 
of Pt,Ag,g and Au,Agj9_, to monitor their PL. 
Under unfocused irradiation, the edges of the 
Pt,Ag;s and Au,Agi9_, crystals were brighter 
than the center, indicating their optical wave- 
guide behavior (Fig. 3, B and C). 

Next, spatially resolved PL imaging was per- 
formed on 1D microrod crystals of Pt:Ag;s and 
Au,Agi9-z to evaluate the optical waveguide 
efficiency (Fig. 3A) by moving a focused ex- 
citation laser beam (450 nm) to different local 
positions along the microrod crystal body. 
Photons propagated predominantly along both 
directions of the axis of the 1D microrod crys- 
tals in two predominant transmission direc- 
tions (Fig. 3, D and E). The emission intensity 
at the tip of the microrod decreased with in- 
creasing propagation distance, but the spectral 
intensity and profiles of the excited points 
were constant. 

To quantify the optical loss of the microrod 
crystals, we calculated the optical loss coeffi- 
cient (R) by single-exponential fitting: Jjip/ 
Ipoay = A exp(-RD), where (ip and Jpoay are 
the intensities at the emitted tip and the ex- 
cited site, respectively, and D is the distance 
between the excited site and the emitted tip 
(Fig. 3, D and E). In the Pt,Ag;g microrod, R 
was calculated to be 5.26 x 107? dB um, and 
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Au,Agi9-y. (M) 1D packing pattern for Au,Agi9-y. (N) A unit cell of Au,Agio_y. 

C and H atoms have been omitted for clarity in (A) to (C). H atoms have been 
omitted for clarity in (D). C, H, and P atoms have been omitted for clarity 
in (N). The intramolecular and intermolecular interactions are indicated by 
red dotted lines in (E) to (L). 


in the Au,Agy9_. microrod, it was calculated to 
be 7.77 x 10° dB um“. These values exhibited 
high consistency across different microrods 
from the same and different batches (figs. S11 
to S13), indicating good repeatability. These 
values are lower than those previously reported 
for inorganic (12), organic (J, 6, 8, 10, 11, 15), and 
hybrid materials (9, 28), the R values of which 
range from ~10 to 60 x 10°? dB um’, and are 
indicative of high optical waveguide efficiency. 
Intramolecular interactions of the NCs inhib- 
ited nonradiative transitions. Intermolecular 
interactions, which result in denser crystal 
packing, high levels of crystallinity, and smooth 
surfaces, helped to diminish emission losses 
by scattering. 

The optical waveguides of the microrods 
exhibited stability, with the Au,Agy9_, micro- 
rod displaying higher stability than the Pt,Agys 
microrod at a temperature of 50°C in an air 
environment (figs. S14 to $16). The observed 
increase in optical loss over time could be at- 
tributed to the evaporation of solvent molecules 
within the crystals under high-temperature 
and oxygen-rich conditions. Furthermore, the 
microrods composed of NCs demonstrated 
excellent stability when subjected to various 
solvents. As shown in figs. S17 to S19, the 
Au,Agj9-z microrod displayed a reduced loss 
coefficient when treated with toluene, p-xylene, 
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Fig. 3. Optical waveguide of Pt,Agig and Au,Agi9-,. (A) Experimental setup 
for the optical waveguide equipment. (B and C) Photographs of hexagonal sheets of 
Pt,Agig and Au,Agi9- under visible light (left) and fluorescence microscopy images 
under unfocused 405 nm irradiation (right). (D and E) PL images of microrod crystals 
of PtAgig and Au,Agi9-, excited with a 450 nm laser focused at different positions. 


and c-pentane, whereas the Pt,Agig microrod 
exhibited a slight increase in the loss coeffi- 
cient when treated with p-xylene, c-pentane, 
and t-butyl methyl ether. Solvents may affect 
the weak interactions within the microrods, 
thereby influencing the optical waveguide per- 
formance of NCs. 

We observed that NC crystals in bent and 
branched states exhibited significant optical 
waveguide effects (figs. S20 and S21). The for- 
mation of bent crystals can be attributed to the 
flexibility of surface organic ligands and the 
intermolecular interactions of NCs. The wave- 
guides formed by NCs in bent and branched 
states provide opportunities for photonic prop- 
agation within miniaturized complex structures. 
Additionally, both NC 2D crystal microsheets 
exhibited optical waveguiding, and we further 
investigated their waveguide directionality. As 
shown in figs. S22 and $23, the luminescence 
intensities of each side of the microsheets 
were virtually identical when excited in the 
center. When one side of the crystal was ex- 
cited, the other sides exhibited similar emission 
intensities. These results indicate the absence 
of anisotropy in the optical waveguide behav- 
ior. However, optical waveguide behavior was 
barely observed in the amorphous powder 
state (fig. S24) and film state (fig. S25) of the 
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NCs, which can be attributed to the limited 
ability of photons to propagate in the absence 
of highly ordered structures of NCs. 

We studied whether other LPMNCs could 
exhibit the optical waveguide properties, and 
found that Au,Cu; (29), Au,;Cuy, (30), and Pt,Ags7 
(3D) also exhibited low R, with R = 13.21 x 10°? 
dB um for Au4Cus, 15.88 x 107° dB um for 
Au,Cuy,, and 9.92 x 10°? dB um ™ for Pt,Ag;7 (fig. 
$26). We further studied the effect of Stokes 
shifts of all of these LPMNCs on R. The large 
Stokes shifts of NCs correspond to a narrow 
overlap in the emission and main absorption 
spectra and effectively diminished reabsorp- 
tion of light during propagation along the 
crystals. However, R did not decrease with the 
increase of Stokes shifts (fig. S27), which indi- 
cated that the Stokes shift was not the only 
factor affecting the optical loss coefficient of 
metal NCs. Smooth surfaces and high crystal 
quality also have important effects on optical 
waveguide properties (22, 32). 


Polarized optical performance of Pt,Agig 
and Au,Agi9-, 


The optical properties of emitters are directly 
affected by the packing of their constituent 
molecules and the orientation of their optical 
transition dipoles. In particular, packing and 
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Numbers 1 to 8 correspond to PL images of excitation along the microrod crystal body 
from left to right. (F and G) Ratio of the intensities at the tip and body of microrod 
crystals (Itip/Ipoay) of PtAgig and Au,Agi9-. versus the distance between the excitation 
and emission spots on the right side of the images in (D) and (E). The curve was 
fitted by the exponential decay function Iip/Iboay =A exp(-RD). 


orientation lead to distinct PL polarization, 
an effect that is important for waveguide de- 
sign, optical connections, and active optical 
communication device integration. To elu- 
cidate the optical anisotropy caused by the 
orientation of the molecules in the microrods, 
polarized optical microscopy was performed. 
By rotating the polarizer to different polar- 
ization angles (0), the linear polarization of 
the light signal emitted from the tips of mi- 
crorods of Pt,Ag;3 and Au,Agj9_, could be ob- 
tained. We measured @ as the angle between 
the horizontal direction of the 1D crystal and 
the polarization direction of the polarizer 
placed in front of the charge-coupled device 
detector (Fig. 3A) and recorded the PL inten- 
sities of the polarized light signals from the 
tip at 30° increments (Fig. 4). 

The relationship between the maximum lu- 
minous intensity and the polarization angle 
of the microrod crystals was well with a sin” 
curve. The Pt,Ag;, microrod displayed the high- 
est fluorescence intensities at polarized angles 
of 8° and 188° and the lowest at 98° and 278° 
(Fig. 4A), leading to an emission dichroic ratio 
(Raq = Igo/Igge) of 21.4 and a polarization ratio 
[p = (Ra - D/Ra + 1] of 0.91, which was be- 
neficial to materials. However, Au,Agi9_. mi- 
crorods exhibited weak polarization properties, 
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Fig. 4. Polarized optical performance of Pt,Agig and Au,Agi9_,. (A and B) Photoemission intensity of the Pt,Agig and Au,Agjo_y, respectively, as a function of the 
polarizer rotation angle 8. (€ and D) Polarized PL spectra of PtjAgig and Au,Agi9_, microrods, respectively, at various angles (~O to 360°). 


with ap of 0.17, as calculated from its high- 
est and lowest PL intensities at 174° and 71°, 
respectively. 

Observation of the molecular orientations of 
Pt,Agis and Au,Agi9_» (Figs. IM and 2M), the 
angle between the transition dipole moment 
of the Pt,Ag,; NCs, and their preferred direc- 
tion of growth results in high polarization em- 
ission. For Au,Agi9-, microrods, however, the 
weak PL anisotropy detected was attributed to 
the zigzag pattern in which the two molecules 
in each unit cell pack together (Fig. 2M). Thus, 
the differences in the polarization properties 
of Pt,Ag;, and Au,Agy9_, can be attributed to 
the partially anisotropic structure of NC enti- 
ties and their incomplete mirror symmetry 
packing modes (33). 


PL properties 

We further explored the emissions of Pt,Agjs 
and Au,Agi9-~ aS amorphous solids and in di- 
chloromethane solution by measuring their 
QY, defined as the ratio of emitted photons to 
absorbed photons, and their PL lifetime. The 
solution of Pt,;Ag;3 was weakly emissive; the 


Wang et al., Science 381, 784-790 (2023) 


maximum emission was centered at 600 nm 
under an excitation of 366 nm (Fig. 5A) with 
a QY of 0.25%. By comparison, solid Pt,Ag;s 
exhibited an intense orange emission (Fig. 5B) 
with a much higher QY of 28.9%. The average 
PL lifetime of Pt;Agig was also improved in the 
solid state, with the lifetimes in solution and 
in the solid state of 0.54 and 1.88 us, respec- 
tively (fig. S28). This trend held, although with 
smaller increases, for Au,Agj9_,. In solution, 
Au,Agi9-7 displayed a weak red emission at 
770 nm with a QY of 4.66% under excitation at 
467 nm (Fig. 5, A and B), but a slightly stronger 
emission was observed in the solid state, with 
a QY of 6.91%. The average PL lifetimes of 
Au,Agj9_x Similarly had a small increase in the 
solid state, as the lifetime in solution and in 
the solid state were 2.68 and 3.17 us, respec- 
tively (fig. S29). The differences in emission 
between Pt,Agis and Au,Agy9_,, can be attrib- 
uted to the alloy effect, which influences the 
origin of the PL. 

This difference in the emissions of Pt,Agis 
and Au,Agj9_~ in different states led us to fur- 
ther explore their aggregation-induced emis- 


18 August 2023 


sion enhancement (AIEE) performance. We 
analyzed solutions of each material in mix- 
tures of acetonitrile (MeCN) and H,0, selected 
for their ability to dissolve the two NCs readily 
and poorly, respectively. The PL intensity of 
Pt,Agi3 was enhanced 30-fold as the propor- 
tion of water in the binary solvent mixture in- 
creased, reaching a maximum value of PL at a 
4:6 volume ratio of MeCN:H2O (Fig. 5C). In- 
creasing the proportion of water >60% de- 
creased the emissions, but the PL was still 
much higher than that in pure MeCN. Similar 
results were observed previously in other AIEE 
active materials (34). 

Dynamic light scattering was used to eval- 
uate the aggregation degree and size distrib- 
ution of Pt,;Ag,s, showing that the aggregation 
of the NCs was highest at 4:6 MeCN:H,0, with 
an average aggregate diameter of 342 nm 
(range 220 to 531 nm) (fig. S30), thus confirm- 
ing the relationship between increased ag- 
gregation and increased PL emission. Similar 
results were observed in solutions of Au,Ago_2, 
with a fivefold enhancement in the PL intensity 
of strong red emissions at a MeCN:H.O volume 
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ratio of 3:7 (Fig. 5D) caused by the maximum 
degree of aggregation (fig. $31). 

To evaluate the effect of aggregation on ab- 
sorption, UV-vis absorption spectra of Pt,Agis 
and Au,Agy9-. in various mixed solvent sys- 
tems were collected. As shown in fig. S32A, 
neither a shift in absorption peak nor a change 
in relative intensity was observed across 10 dif- 
ferent solvent mixtures, suggesting that the 
aggregation of Pt,Ag), is not associated with 
decomposition or structural changes. As shown 
in fig. S32B, the UV-vis absorption spectra of 
Au,Agi9-z in 10 mixed solvent systems also 
showed no absorption peak shift or relative 
intensity change. Both Pt,Ag;, and Au,Agi9_» 
NCs exhibited AIEE performance without de- 
composition or structural changes to the ligand- 
protected NCs, although the PL enhancement 
of Pt,Ag;g was more pronounced. 


Conclusions 


Here, we synthesized and characterized ligand- 
protected Pt,Ag,g and Au,Agi9_2 NCs and 


studied their optical waveguide properties. 
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Both NCs consist of an icosahedral M,; core, 
two crown-like M3Cl3(SR), staple motifs, and 
six DPPP ligands. Crystals of the Pt,Ag;g and 
Au,Agi9-~ NCs exhibited excellent optical wave- 
guide performance, with low optical loss co- 
efficients of 5.26 x 10° and 7.77 x 10° dB um“, 
respectively, which can be attributed to their 
inhibition of nonradiative transition, dense 
crystal packing, and large Stokes shifts. Fur- 
thermore, the optical waveguide performance 
of other NCs was investigated, which revealed 
the universality among ligand-protected, atom- 
ically precise NCs. The different crystal struc- 
tures and packing modes of Pt,Ag;g and 
Au,Agi9-~ account for their distinct polarized 
waveguide performance. Pt,Ag;, microrods 
displayed excellent polarizing properties with 
a polarization ratio of 0.91, whereas Au,Agy9_» 
microrods showed weak polarized emission 
with a polarization ratio of 0.17. Furthermore, 
Pt,Agis and Au,Agj9_, displayed an AIEE ef- 
fect, and 115- and 1.5-fold enhancements in QY 
were observed for Pt,Ag;g (28.9% versus 0.25%) 
and Au,Agy9_» (6.91% versus 4.66%) from the 
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the PL of Pt,Agig (blue circle) and Au,Agi9-y (green circle). (© and D) Photoemission 
spectra of Pt;Agig and Au,Agi9-y, respectively, in mixed solvents of MeCN and 
H20 with different volume ratios. Insets: digital photos of Pt;Agig and Au,Agi9-. 

in mixed solvents with different volume ratios under 365 nm UV light irradiation. 


solid state to solution. Collectively, these re- 
sults suggest that Pt,Ag;, and Au,Agi9_,, NCs 
have potential for use in optical communica- 
tion and miniaturized optoelectronic devices. 
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Universal theory of strange metals from spatially 


random interactions 


Aavishkar A. Patel”, Haoyu Guo***, Ilya Esterlis*®, Subir Sachdev*”* 


Strange metals—ubiquitous in correlated quantum materials—transport electrical charge at low 
temperatures but not by the individual electronic quasiparticle excitations, which carry charge in 
ordinary metals. In this work, we consider two-dimensional metals of fermions coupled to quantum 
critical scalars, the latter representing order parameters or fractionalized particles. We show that 

at low temperatures (7), such metals generically exhibit strange metal behavior with a T-linear resistivity 
arising from spatially random fluctuations in the fermion-scalar Yukawa couplings about a nonzero 
spatial average. We also find a T In(1/T) specific heat and a rationale for the Planckian bound on 

the transport scattering time. These results are in agreement with observations and are obtained in the 
large N expansion of an ensemble of critical metals with N fermion flavors. 


major theme in the study of correlated 

metals has been their strange metal be- 

havior at low temperatures—i.e., a linear- 

in-temperature resistivity smaller than 

the quantum unit of resistivity (h/e” in 
two dimensions), which appears to be con- 
trolled by a dissipative Planckian relaxation 
time of order //(kgT) (where A is Planck’s 
constant, i = h/(2n), e is the electron charge, 
kz is Boltzmann’s constant, and T is the ab- 
solute temperature) (/-8). This behavior is in 
sharp contrast to 7? dependence of the resist- 
ivity and the 1/T? relaxation time, invariably 
observed in conventional metals described by 
Fermi liquid theory. Moreover, the anomalous 
resistivity of strange metals is accompan- 
ied by a logarithmic enhancement of the 
Sommerfeld metallic specific heat to T In(1/T) 
(1) from the ~7 behavior of conventional 
metals. 

Starting with the seminal work by Hertz 
(9), there has been extensive research on the 
properties of electronic Fermi surfaces at quan- 
tum phase transitions (10). The quantum crit- 
ical fluctuations are represented by a scalar 
field, which is usually a symmetry-breaking 
order parameter but could also be a fraction- 
alized particle at phase transitions without an 
order parameter (17). This scalar field has a 
Yukawa coupling to the electrons, by which 
the electrons scatter by emitting or absorbing 
a scalar field excitation (the Yukawa coupling 
is similar to the electron-phonon coupling but 
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without a suppression by the gradient of the 
scalar field). It is now known that such a Fermi 
surface coupled to a quantum critical scalar 
leads to a breakdown of the electronic quasi- 
particle excitations in two spatial dimensions 
(0, 12). But the Fermi surface survives as a 
sharp boundary in momentum space, sepa- 
rating particle- and hole-like excitations, which 
are diffuse in energy space. In the presence of 
random impurities that scatter the electrons 
(13-17), there are cases where the quasi- 
particles are at the boundary of stability, 
which leads to marginal Fermi liquid be- 
havior (78) in single-particle observables, 
such as those observed in photoemission 
experiments. 

However, despite these advances, theory has 
so far found limited success in explaining all of 
the defining transport properties (such as the 
linear-in-T resistivity) of strange metals. Con- 
servation of momentum in the low-energy 
theory of a clean metal implies that the dc 
and optical conductivities are not affected by 
the anomalous self-energy of the excitations 
near the Fermi surface (15-17, 19-22). In other 
words, the strong coupling between the Fermi 
surface and the scalar field places the system 
in the limit of strong scalar drag, and this clean 
metal theory cannot describe strange metal 
behavior. This is in contrast to the electron- 
phonon system, where the weak electron- 
phonon coupling makes phonon drag a factor 
only in ultrapure samples (23). Umklapp scat- 
tering can lead to nonzero resistance, and 
its influence in quantum critical metals has 
been investigated in other works (16, 24). 
However, umklapp is suppressed at low T, its 
predictions for transport are not universal 
and depend upon specific Fermi surface de- 
tails, and there is no corresponding T In(1/7) 
specific heat. 

Given the ubiquity of strange metal trans- 
port across numerous correlated electron mate- 
rials (from the cuprates and the pnictides to 
recently discovered twisted bilayer graphene), 
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a simple and universal mechanism may q ei 
play. We propose that spatial disorder jr 


fermion-scalar Yukawa coupling, about a non- 
zero spatial average, provides just such a uni- 
versal mechanism. Such disorder is ubiquitous 
in models of correlated electron materials; for 
example, in a Hubbard model with on-site re- 
pulsion U and an impurity-induced disorder 
in the electron hopping ¢,, the Schrieffer-Wolff 
transformation generates disorder in the ex- 
change interaction Jy = 4¢;/U, and this disor- 
der then feeds into the Yukawa coupling after a 
standard decoupling procedure (9) that intro- 
duces the scalar field. Moreover, our mechanism 
applies universally across different classes of 
quantum critical metals, with scalars that are 
either fractionalized particles or order param- 
eters at zero or nonzero momentum, which 
have distinct critical behaviors in the clean limit. 
In the limit of a large number of fermion 
flavors, N, we find a universal phenomenology 
that matches observations, including the 7-linear 
resistivity, the Planckian relaxation time, and 
the T In(1/T) specific heat. 

A key observation of our analysis is that al- 
though the fermion inelastic self-energy correc- 
tions can be dominated by the spatially uniform 
coupling, the transport is nevertheless domi- 
nated by the spatially random coupling, and 
this leads to our main results. Our work follows 
other recent works with random Yukawa inter- 
actions (25-34) inspired by the Sachdev-Ye- 
Kitaev (SYK) model (35, 36) along with studies 
that found linear-in-T resistivity with random 
interactions but with vanishing spatial aver- 
age (32, 34, 37-39). 


Spatially uniform quantum critical metal 


We begin by recalling the SYK-inspired large 
N theory of the two-dimensional quantum 
critical metal (32, 34) for the case where the 
order parameter has zero momentum. The 
imaginary time (t) action for the fermion 
field y; and scalar field ~,; (with 7 =1..Na 
flavor label carried by the fermions intro- 
duced only to enable a controlled large N 
limit) is (34) 


N 
Se = Jatd~ SY wh (2) [0c + eC) W(t) + 
k i=l 


N 
pIAD LD bia (82 + Ka? + mB)p, g(t) + 


q i=l 


. N 
SO faxd’r > vi (r, 1) Vv; (¥, 1)0/(r, 7) 


ijl=l 
(1) 


where k and q are spatial momenta, the fer- 
mion dispersion e(k) determines the Fermi 
surface, the scalar mass m, must be tuned to 
criticality and is needed for infrared regula- 
rization but does not appear in final results, 
and the Yukawa fermion-scalar coupling gj; 
is space independent but random in flavor 
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space with 
Sijl = 0,85 8abe = 8° Siadjn51c (2) 


where the overline represents average over 
flavor space. The hypothesis is that a large 
domain of flavor couplings all flow to the same 
universal low-energy theory (as in the SYK 
model), so we can safely examine the average 
of an ensemble of theories. Momentum is con- 
served in each member of the ensemble, and 
the flavor-space randomness does not lead 
to any essential difference from nonrandom 
theories. This is in contrast to position-space 
randomness, which we consider later and which 
does relax momentum and modify physical 
properties. 

The disorder average of the partition func- 
tion of S, leads to a so-called G-= theory, whose 
large N saddle point of Eq. 1 has singular 
fermion (£) and boson (IT) self-energies at T = 
0 (where w is frequency) (34) 


|o 


T(iw,q) = = 60 jg 2(io,) = —iersgn(o)|oo|?/ 
2? g? Qnvpw\ Y? 
o = 1 = 2 p2 
2NKUp 2nopV/3 \ Kg 


(3) 


These results are obtained on a circular Fermi 
surface with curvature « = 1/m, where m is 
the effective mass of the fermions. The result 
is consistent with the theory of two anti- 
podal patches around +k, on the Fermi sur- 
face to which q is tangent, with axes chosen 
so that q = (0, g) and fermionic dispersion 
e(+Ko + k) = turk, + Kk /2. 

The large N computation of the conductiv- 
ity (77, 22) yields only the clean Drude result 
Re[o(@)|/N = nNv78(o)/2, where NV = m/ 
(2m) is the fermion density of states at the 
Fermi level. This is in contrast to the results 
of previous studies (12, 40), in which it has 
been found that dc resistivity is ~7*/? and op- 
tical conductivity is ~|o|~?/*. 


Potential disorder 
We now add to the model a spatially random 
fermion potential 


Spe Sy fer (x)yi(r,)y,(r, 2) 


vy (¥) = 0,03(1)Uim(r") = v76(K — £')i8jm 


(4) 


Here, the overline is an average over both 
spatial coordinates and flavor space. The large 
N limit of the G- theory (7) yields a saddle 
point that has statistical translational invariance 
and is similar to that found in earlier studies 
(13, 15, 16). The low-frequency boson propaga- 
tor now has the diffusive form ~(q?+cg|«|)* 
with dynamic critical exponent z = 2, whereas 
the fermion self-energy has an elastic scatter- 
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ing term along with a marginal Fermi liquid 
(78) inelastic term at low frequencies 


Ng? |o 
T 


TI(iw, q) = — T= 20vN, 


2(#o,k = Ieple - SC) 


a) 3 
ig cin( el ) (5) 
oer \N ev |0) 

at T = 0. However, the marginal Fermi liquid 

self-energy, although leading to a TIn(1/T) spe- 

cific heat, does not (17) lead to the claimed (18) 

linear-T term in the dc resistivity because it 

arises from forward scattering of electrons 
off the q ~ 0 bosons. These forward-scattering 
processes are unable to relax either current or 
momentum because of the small wavevector 
of the bosons involved and the momentum 
conservation of the g interactions. As a result, 
even a perturbative computation of the con- 
ductivity at O(g?) (Fig. 1) shows a cancellation 
between the interaction-induced self-energy 
contributions and the interaction-induced ver- 
tex correction, leading to a dc conductivity 
that is just a constant set by the elastic po- 
tential disorder scattering rate IT’. The leading 
frequency dependence of the optical conduc- 
tivity at frequencies w « I’ is just a constant, 
and there is no linear in-frequency correction 

(17). Correspondingly, in the dc limit, there is 

no linear in-7 correction, and a conventional 

T° correction is expected. 


Interaction disorder 


Our main results are obtained with additional 
spatially random interactions. In principle, 
such terms will be generated under a renor- 
malization analysis from S,. However, such a 
renormalization is not part of our large N 
limit, so we account for such interactions by 
adding an explicit term 


1 
Sz = 


= 5p la? race nle)y) (2. 2) y4(r, 2) O0(8,2) 


S'i(¥) = 0, 84) (2)S'abe(¥") 


= g"8(r—V)da5p5- (6) 


Note, v, g, and g’ are all independent flavor- 
random variables. Earlier works have consi- 
dered the limiting case of g = 0, v = 0, and g’ # 
0 (32, 34). We will instead describe the more 
physically relevant regime where spatial dis- 
order is a weaker perturbation to a clean 
quantum critical system, with g the largest 
interaction coupling. We therefore now have 
g, v, and g’ all nonzero. The theory of S,+ 
S, +S, is described in the supplementary 
materials (41). 

As with S, + S, above, we find a statistical 
translational invariance at large N, with a 
low-frequency boson propagator character- 
ized by z = 2 and the low-frequency fermion 
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self-energy with an elastic scattering term 
along with a marginal Fermi liquid inelastic 
term (42) 


. Neg?lo| x 
M(éw, a) = - | 5 Ns" lol= — cala, 
. » °c ig?o 
z( k=k ie) 2 
iw F cr sgn(@) er 
T2 x 12, 2 
in( 7 ) ans “in(¢ #) (=o) 
VpCa|O| 4n Calo| 
(7) 


where Aq ~ l’/vp. This self-energy leads to a 
T InQ/T) specific heat, as for the large g’ case 
(34). However, there is now an important dif- 
ference with respect to the previous case 
where g’ = 0, which leads to markedly differ- 
ent charge transport properties: The marginal 
Fermi liquid self-energy now contains a term 
(last term in Eq. 7) that does not arise solely 
from the forward scattering of electrons. This 
term is produced by the disordered part of the 
interactions in Eq. 6. Thus, this part of the self- 
energy represents scattering that relaxes both 
current and momentum carried by the elec- 
tron fluid, and therefore its imaginary part on 
the real frequency axis determines the inelas- 
tic transport scattering rate. 

We can show this as follows by computing 
the conductivity using the Kubo formula. If 
we work perturbatively in g and g’, then the 


A Be ¢ 
Oy OE. OB.9! OV.g 


2° 2 


D ~ E , 3 


Fig. 1. Interaction corrections to the conductivity 
in the large N limit. (A to E) The current operators 
are denoted by solid circles, and the wavy lines 
denote boson propagators. Dashed lines denote 
random flavor averaging of the interaction cou- 
plings. The fermion Green's functions (solid lines) 
include the effects of the disordered potential (v), 
and the quantum critical boson propagators include 
the effects of damping due to interactions. Vertex 
corrections [(C) to (E)] contain only g interactions 
because the contributions from g' interactions 
vanish as a result of the decoupling of the 
momentum integrals in the loops containing the 
external current operators. The sum of the two 
Aslamazov-Larkin diagrams [(D) and (E)] vanishes 
exactly in the limit of large Fermi energy, and the 
perturbative result (Eq. 9) is therefore valid to all 
orders in the interaction strength (41). 
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conductivity at O(g?) and O(g”) in the large 
N limit is given by the sum of self-energy 
contributions and vertex corrections (Fig. 1). 
However, owing to the isotropy of the scat- 
tering processes arising from the g’ interac- 
tions, only the vertex correction caused by the 
g interactions survives. The conductivity up 
to the first subleading frequency-dependent 
correction is then given by (41) 


if 
q Relo(o> 7) = Oy + Org + OV g + Org"; 


N vz Nvig?|o| 
Oy(@) = op *Oeat = sare 
N vps" |o| N“v78"|o| 
@ ,05,2'(@ 
Sve() = — gorse (0) 161 


(8) 


Note that the g” vertex and self-energy terms 
cancel, and we have 


12, 
(2r | Ng o 


eee Sa 
° o(@>T)} Nov 4 


(9) 


The g” term does not cancel and leads to a 
linear in-frequency correction to the constant 
transport scattering rate T. In the opposite 
limit || « T, this translates into a T-linear cor- 
rection to the resistivity in the dc limit; com- 
puting the coefficient of the linear-T resistivity 
requires a self-consistent numerical analysis, 
which has been carried out in the large g’ limit 
(32, 34). Notably, the slope of this scattering 
rate with respect to |«| (and therefore 7) does 
not depend on TF and hence on the residual 
(w = T = 0) resistivity. We show (41) that the 
perturbative result described here continues 
to be valid under a full resummation of all di- 
agrams at large N in the Kubo formula be- 
cause all surviving higher-order contributions 
are merely repetitions of the interaction inser- 
tions in Fig. 1, B and C. 

We can also consider the case where v = 0 
but g = O and g’ = O. In this case, we have (at 
T = 0) 4D 

lol 


T(t, q) > mar} = Ns" lal, 
a 12 
X(iw,k) = —i¢-sgn(@)|o|?/? — iNS"G 


6n 
+3 
A 
in{ “— (10) 
Cp|| 


where A~g?/(g’?u;\) is an ultraviolet (UV) 
momentum cutoff. Notably, the disordered in- 
teractions induce a marginal Fermi liquid term 
in X, which manifests as the first higher-order 
correction to the translationally invariant result 
in Eq. 3 (43) 

It is sufficient in this v = 0 but g z O and g’ = 
0 case to compute the conductivity using the 
theory of modes in the vicinity of antipodal 
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points on the Fermi surface (44) We then find, 
as before, that oy, and oy ¢ cancel and (42) 


Nuvi N?vzg? ie eA° 
20 24r0 Co? 
Noy 
12, were : 
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ae »T)= 
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1 Re[o(m>T)] _ 
N N?v2g" 7 


~6 2 
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The transport scattering rate is therefore 
still linear in || (and hence 7) up to loga- 
rithms, and there is no residual resistivity when 
v = 0, despite the presence of disorder in g’. 
This also turns out to be valid to all orders in 
perturbation theory in the large N limit (47). 


N?g't 


36 


(11) 


Crossovers 


For energy (E) scales larger than E,1 ~ I"? /(v3¢a) 
but smaller than Ey» ~ g*/(g’v2.N*) (Egy < Exo 
because Vg"? < g?/T as disorder is a correc- 
tion to the clean system), the leading fre- 
quency dependence of the inelastic part of the 
fermion self-energy induced by g changes 
from ioln(1/|o|) to isgn()|o|*/*, as in the 
theory with v = 0 described above (41). How- 
ever, then, as shown above for v = 0, the || or 
T dependence of the transport scattering rate 
continues to arise from g’ and remains linear 
(up to logarithms) but with a slope that is ~% 
of the slope in the E < E,, theory. For energy 
scales larger than E,», there is an additional 
crossover to the theory with g = 0 considered 
in previous studies (32, 34), which also has a 
linear || or T dependence (up to logarithms) 
of the transport scattering rate, but now with 
the same slope as in the E < E,, theory (41). 


Planckian behavior 


Experimental analyses (6, 7) have compared 

the slope of the linear-T resistivity to the re- 

normalization of the effective mass in a prox- 

imate Fermi liquid and have so deduced a 

scattering time t;. appearing in a Drude formula 

for the resistivity. In our theory, we obtain 
1 kpT 


=o —— 12 
ear (12) 

The dimensionless number o has been com- 
puted previously (32, 34) in the limit g’ > g to 
be a = (/2) x (ratio of logarithms of 7). For 
smaller g’, we find (at v = 0) (41) 


T g? 
ae 7] ‘ Iy9(T) ~—InT 
2g?Ly(T) + fyrLo(T) 


(13) 
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Therefore, Planckian behavior [oa ~ O(1) and 
depending only slowly on T and nonuniversal 
parameters] only occurs in the regime of large 
g’ considered in previous studies (32, 34). Other- 
wise, 0 <1 when g is the largest interaction 
coupling. Our theory therefore provides a con- 
crete realization of the often-conjectured Planck- 
ian bound of a <1 on the transport scattering 
times of quantum critical metals (J, 5, 8). It is 
worth noting that quantum critical T-linear 
resistivity with a <1 has been observed re- 
cently in experiments on heavy fermion ma- 
terials (7). Finally, for v = 0 but g z 0, a«1 
and has a power law dependence on 7; there- 
fore, there is manifestly no Planckian behav- 
ior in this case. 


Scalar mass disorder 

Finally, we consider spatial disorder in the scalar 
mass ™m, and argue that it does not modify our 
results over substantial intermediate scales. Such 
a term is not allowed for emergent gauge fields, 
but it can appear as a fluctuation in the position 
of the quantum critical point for the cases where 
is a symmetry-breaking order parameter 


Sa = far ale wy (L)O,(¥, 1); (Fr, 7) 
(14) 
with 
5 (F)@m(¥") = oe — ¥')(8u8im + Bindu) 


(15) 


The large N analysis shows that S,, is strong- 
ly relevant, so w may well be a substantial 
source of spatial disorder in experimental sys- 
tems. Consequently, it is appropriate to ac- 
count for S,, first by transforming to the bases 
of eigenmodes of o, which are eigenstates of 
the harmonic terms for in a given disorder 
realization. On this basis, we obtain a theory 
that has the same form as Sg + Sy, + Sg with 
additional spatial disorder in the couplings, 
including in K. However, it is not difficult to 
show that spatial disorder in K is unimpor- 
tant. So, S,, can be absorbed in a renormali- 
zation of the values of v and g’, and we can 
continue to use our results for S, + Sy + S,.A 
more thorough analysis of disorder fluctua- 
tion effects is required to determine whether 
this transformation remains valid at the long- 
est scales near the quantum critical point. 


Discussion and outlook 


A phenomenologically attractive feature of our 
theory is that the residual resistivity and the 
slope of the linear-T resistivity are determined by 
different types of disorder—the potential disor- 
der v (which determines the elastic scattering 
rate T) and the interaction disorder g’ (which 
determines the inelastic self-energy in the last 
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term of Eq. 7), respectively. We obtain a marginal 
Fermi liquid electron self-energy (18) as is often 
observed in quantum critical metals (45). 

Because the coupling g’ is spatially random, 
momentum is not conserved at its Yukawa in- 
teraction vertex. The physical properties there- 
fore remain unchanged for order parameters 
at nonzero momentum and for theories with 
multiple Fermi surfaces. 

Our calculations were done in the large N 
limit; we argue that computing all diagrams 
directly at N = 1 would have led to the same 
crucial cancellations. The large N mainly serves 
to systematically select a consistent set of di- 
agrams to resum from the saddle point of an 
effective action. Furthermore, the large N or 
Eliashberg theory agrees well with quantum 
Monte Carlo (QMC) studies in the clean limit 
(carried out with the number of fermion or 
boson flavors of order one) (46-49) and does 
not have a potentially destabilizing Schwarzian 
zero mode (34). A comparison with QMC for 
the disordered case requires substantially more 
advanced computational techniques and is the 
subject of ongoing work (50). The mechanism 
in the work of Shi et al. (22) for the noncom- 
mutativity of the large N and small w limits 
applies only for order parameters with the 
same symmetry as the momentum in the spa- 
tially uniform case and does not apply for the 
spatially disordered case, which has no con- 
served momentum and for which the patches 
do not decouple. 

Our theory of the influence of spatial disor- 
der includes some disorder terms to all orders, 
and this yields the z = 2 diffusive scalar prop- 
agator. This is in contrast to the perturbative 
disorder analysis of earlier memory function 
treatments (16, 20). 

Unlike earlier approaches (2) to construct- 
ing controlled theories of strongly correlated 
metals with low-temperature 7-linear resistiv- 
ity, there is no local criticality in our theory. 
The quantum critical scalar fluctuations live in 
two—and not zero—spatial dimensions. 

When the values of the interaction couplings 
and 7 are large enough to make the fermion 
self-energy = comparable to the Fermi energy, 
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we expect the theories described in this work 


to cross over into a so-called bad metal regime 
(51). It would be interesting to examine the 
transport properties of such a regime. 
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Chemical remodeling of a cellular chaperone to 
target the active state of mutant KRAS 


Christopher J. Schulze't, Kyle J. Seamon’+, Yulei Zhao“t, Yu C. Yang’, Jim Cregg, Dongsung Kim’, 
Aidan Tomlinson’, Tiffany J. Choy’, Zhican Wang“, Ben Sang?, Yasin Pourfarjam’, Jessica Lucas?, 
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The discovery of small-molecule inhibitors requires suitable binding pockets on protein surfaces. 
Proteins that lack this feature are considered undruggable and require innovative strategies 

for therapeutic targeting. KRAS is the most frequently activated oncogene in cancer, and 

the active state of mutant KRAS is such a recalcitrant target. We designed a natural product-inspired 
small molecule that remodels the surface of cyclophilin A (CYPA) to create a neomorphic 

interface with high affinity and selectivity for the active state of KRAS®¢ (in which glycine-12 is 
mutated to cysteine). The resulting CYPA:drug:KRAS“< tricomplex inactivated oncogenic signaling 
and led to tumor regressions in multiple human cancer models. This inhibitory strategy can be 

used to target additional KRAS mutants and other undruggable cancer drivers. Tricomplex inhibitors 


that selectively target active KRAS@2° 


(NCT05462717 and NCT05379985). 


RAS is a small guanosine triphospha- 

tase (GTPase) that cycles between an in- 

active [guanosine diphosphate (GDP)- 

bound, OFF] state and an active (GTP- 

bound, ON) state (). Active KRAS binds 
to and activates several effector proteins to 
regulate cell growth and proliferation (2). 
KRAS mutations act as oncogenic drivers that 
stimulate excessive downstream signaling and 
proliferation (3). KRAS@”© (in which glycine-12 
is mutated to cysteine) is the most frequent 
KRAS mutation in non-small-cell lung cancer 
(NSCLC) (4, 5), and 37 to 43% of patients with 
NSCLC who harbor this variant respond to 
treatment with inhibitors that target the 
inactive state of KRAS“”°, such as sotorasib 
and adagrasib (6-9). The clinical benefits of 
these agents represent an important advance 
in precision oncology. Nevertheless, both are 
limited with regard to the depth and duration 
of response. Although the reasons for such 
limitations are multifactorial, cancer cells ap- 
pear to bypass inactive state-selective inhibition 
by increasing the amount of drug-insensitive, 
GTP-bound KRAS@”° (10-15). 
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or multiple RAS mutants are in clinical trials now 


To date, efforts to target the active state of 
KRAS by traditional small-molecule drug dis- 
covery strategies have been unsuccessful, sug- 
gesting that innovative approaches are needed 
for its inhibition. One such approach is inspired 
by natural products such as rapamycin and 
FK506, which engage the immunophilin FKBP12 
to inhibit mechanistic target of rapamycin 
(mTOR) or calcineurin, respectively (16-18). Al- 
though the targets of these natural products are 
dictated by evolution (19), we used structure- 
based redesign of an immunophilin ligand to 
direct its paired endogenous immunophilin to 
target the active state of mutant KRAS. 


Results 
Remodeling cyclophilin A to generate a 
neomorphic interface that binds to active KRAS 


We began by focusing on the immunophilin 
cyclophilin A (CYPA) because of its favorable 
electrostatic surface charge complementarity 
with residues on the effector binding interface 
of KRAS, which is not a feature of FKBP12 (fig. 
S1A). Sanglifehrin A (fig. S1B) is a natural pro- 
duct known to bind CYPA with high affinity 
(20). A tool ligand (compound-1) containing a 
minimal CYPA-binding motif of sanglifehrin A 
and a promiscuous cysteine-reactive warhead 
(Fig. 1, A and B) was synthesized to facilitate 
covalent tethering and de novo tricomplex 
formation. The crystal structure of the primi- 
tive CYPA:compound-1:KRAS“”* tricomplex 
(1.40 A; Fig. 1A and table S1) suggested that 
macrocyclization of the ligand (fig. S1C) would 
decrease conformational entropy and increase 
protein contacts. Macrocyclization through a 
substituted indole linker introduced specific 
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dat 
and the phenolic hydroxyl was removel. efecneed 


reduce the number of hydrogen-bond donors 
and acceptors. The resulting cyclized core was 
modified with either an acetamide moiety that 
enables reversible binding (compound-2; Fig. 
1B) or an acrylamide warhead to covalently en- 
gage the C12 residue (compound-3; Fig. 1B). As 
expected, compound-2 induced CYPA com- 
plexes with guanosine-5’-[(B,y)-imido]triphosphate 
(GMPPNP)-bound wild-type KRAS (KRAS“") 
as well as KRAS@© [half-maximal inhibitory 
concentration (IC;): 510 or 180 nM, respectively; 
Fig. 1C]. By comparison, compound-3 had in- 
creased potency for inducing CYPA complexes 
with active KRAS@”© (IC59: 52 nM) while re- 
taining weak reversible binding to KRAS”” 
(ICs9: 520 nM). Compound-3 was modified to 
further increase the selectivity for KRAS®”“ Re- 
placement of the acrylamide in compound-3 with 
an ynamide warhead generated compound-4, 
which had greater potency and selectivity for 
KRAS¢?°-CYPA tricomplex formation (IC;0: 
28 nM, 39-fold more selective; Fig. 1C). Owing to 
the limit of detection in the tricomplex forma- 
tion assay (see materials and methods), we elected 
to further optimize the warhead and linker using 
a kinetic target engagement assay, which affords 
higher resolution for potent compounds. The 
maximal rate of KRAS°”° target engagement 
for compound was 26-fold higher than that of 
compound-3 (Fig. 1, B and D, and fig. SID). In- 
stallation of a conformationally optimized linker 
improved the maximal rate a further 14-fold and 
resulted in RMC-4998 (Fig. 1, B and D, and fig. 
SID). The covalent engagement efficiency of RMC- 
4998 [the ratio of the rate constant of enzyme in- 
activation (Kina) to the inhibition constant (Kj): 
272,000 M™' s™ (Kinac/ KD: 272,000 M7's™"] was 
greater than that of existing inactive state- 
selective inhibitors (27-23), even though RMC- 
4998 targets the active or guanosine triphosphate 
(GTP)-bound state of KRAS®”*. 

We next sought to determine the structural 
basis for tricomplex formation and KRAS in- 
hibition. RMC-4998 bound to CYPA reversibly 
to form a low-affinity binary complex [dissocia- 
tion constant (Ka) = 1.09 uM, dissociation rate 
constant (Ko) = 1.0 s”; Fig. 1E and fig. S1E]. 
The crystal structure of the mature CYPA:RMC- 
4998:KRAS°?°-GMPPNP tricomplex (1.53 A; 
table S1) revealed that the aforementioned 
chemical optimizations increased the number 
of contacts between RMC-4998 and CYPA, com- 
pared with those observed in the primitive com- 
plex (fig. S2, A and B). RMC-4998 retained the 
piperazic acid and peptidic linker of compound-1 
to maintain interactions with Q63 and the 
N102 backbone, but the substituted indole in 
the macrocycle of RMC-4998 introduced an 
additional cation-n interaction with R55 as 
well as numerous favorable hydrophobic con- 
tacts (fig. $2, A to G). Through these inter- 
actions, RMC-4998 remodeled the molecular 
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Fig. 1. Development of a tricomplex inhibitor that targets the active state 
of KRAS®2°, (A) Tertiary structure of the primitive CYPA:compound-1: 
KRAS‘!2°-GMPPNP complex at 1.40-A resolution. (B) The chemical structure of 
compound-1 and the macrocyclized scaffold for subsequent compounds, with 
the sanglifehrin-derived CYPA-binding moiety shown in blue. R denotes divergent 
chemical moieties comprising noncovalent (black) and covalent (purple) 
derivatives. (C) The interaction between the indicated KRAS proteins (50 nM) 
and CYPA (20 uM) in the presence of increasing compound concentrations was 
determined by time-resolved fluorescence resonance energy transfer (TR-FRET) 
(mean + SEM, n = 3, N = 2). (D) The rate (kops) of covalent modification of 
KRAS®° in the presence of CYPA (25 uM) and the indicated compounds (mean 
+ SEM, N = 3). (E) Schematic of tricomplex formation by RMC-4998 along with 
rate constants for each reaction. (F to H) Mapped pairwise atomic distances 
between structures of apo-CYPA [Protein Data Bank (PDB) ID 3KON (30)] and 
RMC-4998-bound CYPA showing the structural rearrangements (white-to-green 


gradient) that gave rise to the high-affinity n 


eomorphic binding interface for 


KRAS®2¢ (F). Interactions between RMC-4998, CYPA, and the switch | (G) or 
switch Il (H) regions of KRAS“° in the crystal structure of the mature tricomplex (1.53 A). 
Dashed lines indicate interactions as per the key in (A). (I) The interaction between 
KRAS°° (2 uM), RMC-4998 (10 uM), and CYPA (2 uM) was determined by native 


polyacrylamide gel electrophoresis (PAGE). (J) As in (I), but KRAS&2° 


was loaded with 


nonhydrolysable GTPyS (active state) or GDP (inactive state). A representative 

of at least two independent experiments is shown in (I) and (J). (K) Human embryonic 
kidney 293 (HEK293) cells coexpressing small-bit luciferase-tagged RAS variants and 
large-bit luciferase tagged CYPA were treated with RMC-4998 (100 nM) for 2 hours 


followed by determination of luciferase activity ( 


mean + SEM, N = 4). FC, fold change. 


Single-letter abbreviations for amino acid residues are as follows: A, Ala; C, Cys; 


D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; 
R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. For all figures, unless otherwise 
indicated, N and n denote biological and technical replicates, respectively. 


surface of CYPA to create a neomorphic in- 
terface with affinity for KRAS®”° (Fig. IF and 
movie S1). 

The covalent engagement of KRAS°”° by 
CYPA:RMC-4998 led to the formation of a 
stable tricomplex (Fig. 1E) with a half-life that 
was longer than 8 hours after compound 
washout in biophysical assays (fig. S3A) and 
cellular experiments (fig. S3B). The crystal 
structure of the tricomplex revealed several 
key interactions that enable stable binding 
(Fig. 1, G and H): (i) Noncovalent contacts be- 
tween RMC-4998 and the switch I and II mo- 
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tifs of KRAS°”°. These were mediated by the 
indole introduced during macrocyclization 
and included lipophilic interactions with I36 
on the switch I motif (Fig. 1G) and a a-z in- 
teraction with Y64 on the switch II motif (Fig. 
1H) of KRAS. (ii) Neomorphic contacts be- 
tween CYPA and KRAS°”*, including hydro- 
gen bonds between N71, K151, and R148 in 
CYPA and E31, D33, and E37 in the switch I 
motif of KRAS (Fig. 1G), as well as W121 in 
CYPA and Y64 in KRAS (Fig. 1H). (iii) Covalent 
modification of C12 by the ynamide warhead 
in RMC-4998 (Fig. 1H and fig. S2D). 
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The remodeled molecular surface of inhibitor- 
bound CYPA enabled selective and active state- 
specific binding to KRAS@°, All three com- 
ponents were required for tricomplex forma- 
tion (Fig. 11 and fig. S4A); neither RMC-4998 
nor CYPA were able to bind to KRAS®”* alone. 
No complex was detected with GDP-loaded 
KRASG?S, suggesting a selectivity for the active 
(GTP-bound) state of the mutant (Fig. IJ). RMC- 
4998 induced a tricomplex between KRAS@?° 
and CYPA in live cells without affecting wild- 
type KRAS, NRAS, or HRAS (Fig. 1K). RMC- 
4998 had comparable activity on G12C-mutant 
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Fig. 2. Structural constraints for tricomplex formation and active state— 
selective KRAS inhibition. (A) Superimposition of the structures of the CYPA:RMC- 
4998:KRAS°< tricomplex and the KRAS:CRAFR®°/CR complex (PDB ID 6XI7) 
(31). (B) The effect of RMC-4998 on the interaction between the indicated KRAS 
proteins (12.5 nM) and BRAF®®° (50 nM) in either the presence or the absence of 
CYPA was determined by TR-FRET (mean + SD, N = 4). (€ and D) HEK293 

and large-bit luciferase- 
tagged CYPA (C) or full-length CRAF (D) were treated with RMC-4998 (100 nM) 
followed by determination of reconstituted luciferase activity in live cells (mean + 
SEM, N = 3). (E) The indicated parental, CYPA-null, or CYPA-rescued H358 cells 


cells coexpressing small-bit luciferase-tagged KRASC!° 


HRAS, NRAS, and KRAS, with little, if any, 
activity on KRAS@”° (fig. S4, B and C). 


Tricomplex formation disrupts effector binding 
to active KRAS 


Structural superpositions suggested that when 
bound in the tricomplex, CYPA occludes the 
effector binding interface of KRAS®”“, includ- 
ing that occupied by the RAS-binding and 
cysteine-rich domains (RBD and CRD, respec- 
tively) of CRAF (Fig. 2A), the catalytic subunit 
(p110) of phosphatidylinositol 3-kinase (PI3K) 
(fig. S5A), the allosteric site of the catalytic 
domain of SOS1 (SOS,,1; fig. S5B), and the RAS- 
interacting domain (RID) of RAL guanine nu- 
cleotide dissociation stimulator (RALGDS; fig. 
S5C). Indeed, the formation of the CYPA:RMC- 
4998:KRAS“”“ tricomplex led to a concentration- 
dependent dissociation of the BRAF RBD and 
RALGDS RID from mutant KRAS in biochemical 
assays (Fig. 2B and fig. S5D). In a live-cell kinetic 
assay, RMC-4998 treatment led to rapid asso- 
ciation of KRAS®”© with CYPA, an effect that 
was paralleled by the dissociation of full-length 
CRAF from KRAS@”. (Fig, 2, C and D). CYPA was 
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indispensable for inhibition by RMC-4998, as 
evidenced in biochemical assays (Fig. 2B) as well 
as in CYPA-null cells and those with rescued ex- 
pression (Fig. 2E and fig. S6, A and B). By com- 
parison, CYPA loss had no effect on the cellular 
activity of the inactive state-selective KRAS@?° 
inhibitor adagrasib (fig. S6, C and D). It is pos- 
sible that a dependency on CYPA expression could 
limit the therapeutic utility of tricomplex inhibi- 
tors like RMC-4998. This is unlikely to be true, 
however, when considering that CYPA is highly 
abundant across various cancer types (fig. S6E), 
has low interpatient variation in expression (fig. 
S6F), and has higher expression in tumors com- 
pared with normal tissue (fig. S6G). 


Functional interrogation of the neomorphic 
binding interface 


The distinct inhibitory mechanism of RMC- 
4998 prompted an unbiased evaluation of the 
functional role of the amino acid residues in 
the neomorphic binding interface between CYPA 
and KRAS®”“. To this end, KRAS@”° mutant 
cells were infected with a doxycycline (dox)- 
inducible cDNA library, encoding all possible 
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were treated as shown and their extracts were analyzed by RBD-pulldown and 
immunoblotting to determine the effect on KRAS activation. A representative of 
two independent experiments is shown. sgCYPA, CYPA-targeting single guide RNA; 
sgNT, nontarget single guide RNA. (F) KRAS mutant cells were infected with a 
dox-inducible saturation mutagenesis library based on a KRAS““° backbone and 
treated with either DMSO or RMC-4998 (100 nM) for 2 weeks. Shown is the log(fold 
change) (logFC) in abundance relative to the start time (mean + 95% confidence interval, 
N = 3) for variants meeting the threshold for statistical significance (see materials and 
methods). (G) The effect of the indicated CYPA variants on tricomplex formation in live 
cells was determined as in (C) (mean + SEM, N = 3). LFC, log2(fold change). 


amino acid substitutions in KRAS®”“, followed 
by treatment with either dimethyl sulfoxide 
(DMSO) or RMC-4998 for 2 weeks in the pres- 
ence or absence of dox (fig. S7A). The analysis 
confirmed the importance of the KRAS C12, 
136, and E37 residues (Fig. 2F), as predicted by 
the structural studies described in the previous 
sections (Fig. 1, G and H). The screen, however, 
revealed that mutations in additional residues, 
including those at G13, P34, and A59, attenu- 
ated the effect of RMC-4998 (Fig. 2F and fig. 
S7B). The latter likely disrupt the conforma- 
tion of the switch regions to prevent CYPA: 
RMC-4998 from engaging active KRAS°°, 
Cells expressing KRAS@”© that contains a P34R 
(P34—R), I136Q, or A59G secondary mutation 
confirmed the ability of these variants to ac- 
tivate extracellular signal-regulated kinase 
(ERK) signaling in the presence of RMC-4998 
(fig. S7C). Secondary in-cis mutations on KRAS@?° 
(for example, R68, H95, and Y96) confer re- 
sistance to adagrasib and/or sotorasib (J7-13). 
RMC-4998 occupies a distinct binding site and 
does not contact any of these residues, sug- 
gesting that it should retain inhibitory activity 
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Fig. 3. Cellular effects of A 
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against these mutants (Fig. 1G). Indeed, mu- 
tations in these residues were not identified 
in the saturation mutagenesis screen (fig. 
S7A) and had no effect on displacement of the 
CRAF RBD by RMC-4998 in a live-cell assay 
(fig. S7D). 

We next used the live-cell biosensor to in- 
terrogate residues in CYPA that contribute 
to the neomorphic interface. Alanine muta- 
tions at the N71, K151, and R148 residues of 
CYPA that form direct contacts with KRASS?° 
resulted in a loss of tricomplex formation 
(Fig. 2G). Mutations in their interacting part- 
ners on switch I of KRAS®”° (E31A, D33A, 
and E37A) also abolished the formation of 
the tricomplex, highlighting the importance 
of these residues in binding (Fig. 2G). These 
mutations had a minimal effect on the en- 
zymatic proline isomerase activity of CYPA 
(fig. S8). 


Selective inhibition of oncogenic signaling 
and proliferation 


Disruption of effector binding by RMC-4998 in- 
hibited downstream ERK signaling in KRAS@?° 
mutant cancer cell models, with an ICs, rang- 
ing between 1 and 10 nM (Fig. 3A). RMC-4998 
treatment also attenuated AKT-MTOR and 
RAL signaling (fig. S9, A and B). The latter 
pathways, however, had residual activity de- 
spite the presence of the drug, suggesting that 
they are only partially activated by mutant 
KRAS in cancer cells. Targeting the GTP-bound 
state of KRAS@”° with the tricomplex inhibitor 
RMC-4998 (0.1 uM) led to a faster disruption 
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of the interaction between KRAS®”© and CRAF 
compared with inactive state-selective inhib- 
itors adagrasib (1 uM) and sotorasib (10 uM) 
(Fig. 3B). A similar disruption was observed 
for the interaction between KRAS®”© and 
SOScat (fig. S9C). Complete covalent modifica- 
tion of KRAS@”° in RMC-4998-treated MIA 
PaCa-2 cells occurred within 5 min, again faster 
than that observed with adagrasib (fig. S9D). 
The kinetics of inactive state-selective inhibi- 
tion are limited by the rate of GTP hydrolysis 
by KRAS@”S (94), which RMC-4998 overcomes 
by directly targeting the active state. 

Another limitation of inactive state-selective 
inhibitors is their susceptibility to cellular 
stimuli (10) that drive nucleotide exchange to 
induce GTP loading of KRAS@”°. This is par- 
ticularly evident after receptor tyrosine kinase 
(RTK) activation (25, 26). Indeed, exposure to 
epidermal growth factor (EGF) or hepatocyte 
growth factor (HGF) led to attenuated target 
engagement (fig. S9E), ERK inhibition, and 
antiproliferative effect by the inactive state- 
selective inhibitor adagrasib (Fig. 3C). By con- 
trast, RTK stimulation had a minimal effect 
on RMC-4998 activity. In the absence of growth 
factor stimulation, treatment of KRAS@?°- 
mutant cell lines with either RMC-4998 or 
adagrasib produced initial pathway suppres- 
sion followed by a rebound over the course of 
the next 72 hours. The readdition of RMC-4998, 
but not adagrasib, suppressed the reactivated 
pathway signaling (fig. SOF). Therefore, target- 
ing of the active state KRAS@”° through tricom- 


plex formation has distinct biological properties 
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from existing inactive state-selective clinical- 
stage inhibitors. 


Potent antitumor activity in cell line and 
patient-derived xenografts 


Further optimization of RMC-4998 led to RMC- 
6291, an active state-selective KRAS®”“ inhibitor 
now undergoing clinical testing. RMC-4998 and 
RMC-6291 have similar chemical structures and 
nearly identical kinetic constants for engaging 
active KRAS@”© (fig. S10A). Both RMC-4998 and 
RMC-6291 inhibited ERK signaling and induced 
apoptosis in KRAS@?“mutant H358 cells (fig. SIOB). 
In a cancer cell line panel (Fig. 4A and table S2), 
RMC-6291 inhibited the proliferation of KRAS@”° 
mutant cells with a median ICs of 0.11 nM, or 
13,500 times more potently than non-G12C mu- 
tant models, indicating its selectivity index. 
RMC-4998 had a median ICs of 0.28 nM and a 
selectivity index of 1450. Of note, both the potency 
and the selectivity index of RMC4998 and RMC- 
6291 were greater than those of adagrasib. Se- 
lective cellular engagement of KRAS@”° was 
further evidenced by a proteome-wide cysteine- 
reactivity assay, wherein RMC-6291 reacted with 
KRAS@”© to a greater degree than it did with 
other cellular proteins (fig. S11A and data S1). 
Daily oral administration of RMC-6291 in 
mice bearing NCI-H358 xenografts was well 
tolerated (fig. S11B) and led to near-complete 
tumor regression (Fig. 4B). RMC-6291 achieved 
dose-dependent plasma concentrations and 
inhibition of downstream signaling output, as 
assessed by the effect on tumor DUSP6 mRNA 
expression (Fig. 4C). A single dose of RMC-6291 
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Fig. 4. Potent and selective suppression of KRAS@¢-driven tumor growth 
by tricomplex inhibitors. (A) The indicated cells were treated with increasing 
concentrations of RMC-6291, RMC-4998, or adagrasib for 120 hours and the effect 
on cell viability was determined using the 3D CellTiter-Glo assay. 
represents an individual cell line (n = 17 cell lines for G12C, n = 
non-G12C), and the gray line indicates the median ICs9 for each group of cell lines. 
(B) Mice bearing H358 CDX tumors were treated with RMC-6291 at the indicated 
dose, administered orally once daily, and the tumor volume was assessed for 

28 days (mean + SEM). ***Adjusted p < 0.001 for RMC-6291 (all dose groups) 
versus control, using repeated measures two-way analysis of variance (ANOVA) 


led to sustained engagement of KRAS®”“ and 
inhibition of ERK phosphorylation in tumors 
for ~24 hours and also induced apoptosis, as 
evidenced by TUNEL (terminal deoxynucleo- 
tidyl transferase-mediated deoxyuridine tri- 
phosphate nick end labeling) staining (fig. 
$12, A to D). These data indicate that the tri- 
complex inhibitor RMC-6291 is a potent and 
selective inhibitor of the active conformation 
of KRAS@”© in vivo. Similar in vivo activity was 
observed with RMC-4998 (fig. S13, A to C). 
Next, we evaluated the antitumor activity 
of RMC-6291 across a panel of cell line-derived 
(CDX) and patient-derived (PDX) xenograft mod- 
els of KRAS®”° mutant NSCLC and colorectal 
cancer (CRC). Targeted exome DNA sequencing 
revealed that the PDX panels were representa- 
tive of the genomic landscape of patients with 
KRAS@”° mutant lung cancer or CRC (fig. S14, A 
and B). RMC-6291 treatment resulted in mean tu- 
mor regression in 76% (19/25) of NSCLC models 
and in 40% (6/15) of the CRC models tested after 
astandard 28-day treatment period (Fig. 4, D and 
E, and table S3). These results indicate that RMC- 
6291 can drive deep antitumor responses after 
daily oral administration in preclinical studies. 


Discussion 


The discovery of RMC-4998 and RMC-6291 as 
tricomplex inhibitors of GTP-bound KRAS@”° 
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represents the successful reengineering of an 
immunophilin-binding natural product to en- 
gage a target previously thought to be undrug- 
gable. Neither CYPA nor its natural product 
ligand have been previously reported to inter- 
act with KRAS. Their complex formation was 
the product of purposeful, structure-guided 
chemical modifications to mold a high-affinity 
binding interface between CYPA and active 
KRAS@“, The target-directed approach de- 
scribed here differs from other pharmacologic 
or proteomic screens that describe the identi- 
fication of rapamycin or sangliferin analogs 
that engage additional targets besides MTOR 
or CYPA, respectively (27, 28). Selective dis- 
ruption of effector binding to the active state 
of KRAS@”° by the recruitment of CYPA is a 
distinct inhibitory mechanism that promises 
to overcome some of the limitations of inactive 
state-selective KRAS®”° inhibitors while main- 
taining selective target engagement and a po- 
tentially wide therapeutic index. A phase 1/1B 
clinical trial testing RMC-6291 (NCT05462717) 
is ongoing at present. 

The tricomplex inhibitory strategy described 
here has broad implications for cancer ther- 
apy. It can be used to develop inhibitors of 
additional oncogenic RAS mutants through 
the introduction of distinct covalent warheads 
or functional groups that can selectively target 
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(n = 6 mice per group for control, n = 8 mice per group for RMC-6291); values were 
adjusted based on multiple comparison via Dunnett's test on the final tumor 
measurement. (C) The unbound plasma concentration of RMC-6921 and the 
expression of DUSP6 mRNA in H358 tumors after administration of a single oral 
dose of RMC-6291 at the indicated doses (mean + SEM, n = 3 replicate mice). 
(D and E) The indicated NSCLC (D) or CRC (E) xenograft model mice were treated 
with RMC-6291 (200 mg per kg of body weight administered orally once daily) to 
determine the effect on mean tumor growth or regression after 28 + 2 days (percent 
change from baseline, mean + SEM, n as indicated in table S3). The dashed line 
indicates a 10% reduction in tumor volume from baseline. 


other RAS mutants (G13C, G12D, etc.) in an 
allele-specific manner. Alternatively, further 
optimization of the reversible drug binding 
capacity demonstrated by compound-2 would 
enable the concurrent (nondiscriminatory) in- 
hibition of multiple RAS oncoproteins. A re- 
versible tricomplex inhibitor RMC-6236 (29) 
(RAS“™"") developed through this approach is 
also in a phase 1/1B clinical trial (NCT05379985). 
Although its broader applicability requires 
further investigation, the reengineering of nat- 
ural products to create neomorphic binding 
interfaces on their paired immunophilins may 
prove effective at targeting additional “un- 
druggable” cancer drivers, even beyond RAS. 
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MyoD-family inhibitor proteins act as auxiliary 


subunits of Piezo channels 


Zijing Zhou'+, Xiaonuo Ma**+, Yiechang Lin®, Delfine Cheng”, Navid Bavi°, Genevieve A. Secker’, 
Jinyuan Vero Li??, Vaibhao Janbandhu??, Drew L. Sutton”*®, Hamish S. Scott”®°, Mingxi Yao’, 
Richard P. Harvey’? Natasha L. Harvey”®, Ben Corry®, Yixiao Zhang***, Charles D. Cox!2* 


Piezo channels are critical cellular sensors of mechanical forces. Despite their large size, ubiquitous 
expression, and irreplaceable roles in an ever-growing list of physiological processes, few Piezo channel— 
binding proteins have emerged. In this work, we found that MyoD (myoblast determination)—family inhibitor 
proteins (MDFIC and MDFI) are PIEZO1/2 interacting partners. These transcriptional regulators bind to 
PIEZO1/2 channels, regulating channel inactivation. Using single-particle cryogenic electron microscopy, we 
mapped the interaction site in MDFIC to a lipidated, C-terminal helix that inserts laterally into the PIEZO1 
pore module. These Piezo-interacting proteins fit all the criteria for auxiliary subunits, contribute to 
explaining the vastly different gating kinetics of endogenous Piezo channels observed in many cell types, 
and elucidate mechanisms potentially involved in human lymphatic vascular disease. 


o decode mechanical cues, cells are en- 
dowed with a palette of molecular force 
sensors. Among these sensors, Piezo ion 
channels (7) have emerged as critical force 
sensors that participate in determining 

how cells sense their physical environment. 
Piezo channels assemble as trimers that pos- 
sess all the structural requirements for mecha- 
nosensitivity (2, 3). However, native PIEZO1 
channels can display nonuniform subcellu- 
lar localization (4-7) and exhibit different 
gating kinetics—principally, slower inactiva- 
tion (J, 8-73)—in many cell types when com- 
pared with heterologous expression systems. 
These observations could be explained by dif- 
ferences in lipid composition (74, 15), curvature- 
dependent sorting (5, 7), or protein-protein 
interactions. Many ion channels interact with 
auxiliary subunits (J6, 17) to modify their 
cellular location and gating properties. Ion- 
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channel auxiliary subunits are commonly defined 
by four criteria: (i) they are non-pore-forming 
subunits, (ii) they have a direct and stable in- 
teraction with the pore-forming subunit, (iii) 
they modulate channel properties in heterol- 
ogous systems, and (iv) they regulate endoge- 
nous channel activity in native cells (77). Despite 
substantial effort, to our knowledge, no Piezo 
channel-binding partners or auxiliary subunits 
have emerged that fit these criteria. 


Identification of Piezo channel-binding proteins 


To identify binding partners for Piezo channels 
we used two complementary affinity-capture 
mass spectrometry (AC-MS) strategies in con- 
junction with two CRISPR-Cas9-edited, PIEZO1- 
expressing, human dermal fibroblast (hDF) 
lines (Fig. 1, A to F, and fig. $1). The reason we 
chose fibroblasts lies in the previously reported 
slower inactivation kinetics of PIEZO1 in this 
cell type (0, 12, 13). Our pipeline consisted of 
three comparator groups: (i) hDF with PIEZO1 
ablated with CRISPR/cas9 (that were com- 
pared with wild-type cells in which PIEZO1 
was enriched through a conventional anti- 
PIEZO1 antibody strategy), (ii) hDF with a 
HaloTag added into the endogenous PIEZO1 
loci (P1-Halo) where PIEZO1 was enriched with 
HaloTrap resin, and (iii) primary human car- 
diac fibroblasts (hCF) in which PIEZO1 was 
enriched through a conventional anti-PIEZO1 
antibody strategy. We then stringently ana- 
lyzed the resulting MS data to identify PIEZO1- 
interacting proteins present in all three groups 
that were not present in any of their respective 
negative controls. Using these criteria, we only 
identified two proteins, the first of which was 
PIEZO1 (Fig. 1E). This provided strong val- 
idation for both affinity-capture strategies. 
The second protein identified was the sparsely 
studied transcriptional regulator MyoD (myo- 
blast determination) family-inhibitor domain- 
containing protein (MDFIC) (78, 19). MS provided 
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over the three groups (Fig. 1F). 

Using RNA expression data, we identified many 
cell types in addition to fibroblasts that coex- 
press Piezo channels and the MyoD-family in- 
hibitor proteins, MDFIC or MDFI (fig. S2, A and 
B) (20). We then validated the protein-protein 
interaction by expressing N-terminally hemag- 
glutinin (HA)-tagged MDFIC together with 
PIEZO1 in human embryonic kidney 293 cells 
with SV40 large-T antigen (HEK293T). Using a 
coimmunoprecipitation (co-IP) assay, we could 
reciprocally capture PIEZO1 with MDFIC and 
found that the complex was present under 
mechanical (shear stress) or chemical [10 nM 
Yoda-1 (21)] activation of PIEZO1 (fig. S2C), 
which indicated the stability of the interac- 
tion. We also confirmed that PIEZO1 interacted 
with the closely related MDFI (fig. S2D) (22). 
PIEZO2 has high sequence similarity with 
PIEZO1, so we next tested whether MDFIC se- 
lectively interacted with PIEZO1/2 channels 
using native gels. We identified MDFIC at the 
size of the respective PIEZO1/2 trimers but not 
in oligomeric complexes of other structurally 
unrelated channels such as TRPM4 (tetramer) or 
TREK-1 (dimer) (fig. S2E). In doing so, we found 
that PIEZO1 enhanced the protein amounts of 
both MDFIC and MDFI by greater than three- 
to fourfold (fig. S2, C to E). To determine the 
specificity of this effect, we coexpressed MDFIC 
with PIEZO1 and PIEZO2 and compared the 
amount of MDFIC when expressed alongside 
other unrelated ion channels. Coexpression of 
MDFIC with green fluorescent protein (GFP), 
TRPM4, TRPV4, or TREK-1 did not enhance 
the amount of MDFIC (fig. $3, A and B). To 
understand the mechanism, we used a cyclo- 
heximide chase assay to observe the degrada- 
tion time course of MDFIC. In cells treated for 
4 to 8 hours with the protein synthesis inhib- 
itor cycloheximide, we observed using Western 
blotting that the protein amount of MDFIC fell 
much more rapidly in the absence of PIEZO1 
(fig. $3, C and D). This suggests that interacting 
with PIEZO1 decreased MDFIC turnover. 


MDFIC binds the PIEZO1 pore module 


To provide molecular details of the interaction, 
we coexpressed mouse PIEZO1 (mPIEZO1) with 
N-terminally FLAG-tagged mouse MDFIC and 
purified the complex using FLAG resin. Using 
single-particle cryogenic electron microscopy 
(cryo-EM), we determined the structure of the 
PIEZO1-MDFIC complex at an overall resolu- 
tion of 3.66 A (Fig. 2, A and B, and fig. 4). We 
resolved the C-terminal 21 amino acids of MDFIC 
(Fig. 2, A to D), whereas the N-terminal portion 
displayed little or no density, presumably ow- 
ing to local dynamics. The resolved region of 
MDFIC (residues 225 to 246) consists of an 
amphipathic helix that sits parallel to the 
membrane at the membrane interface (Fig. 
2C). This helix inserts laterally into the PIEZO1 
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Fig. 1. AC-MS identifies a newly characterized 
family of Piezo-channel binding partners. 
Groups for AC-MS consisted of: (A) WT and 
PIEZOl-edited (knockout, KO) human 

dermal fibroblasts (hDF; n = 2), (B) WT and 
PIEZO1-HaloTag (P1-Halo) hDF (n = 2), and 

(C) primary human cardiac fibroblasts (hCF; n = 2). 
(D) Affinity-captured protein lysates were run 

on SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE) gels and sectioned into quadrants 
(red dashed lines). Each quadrant was subjected to 
in-gel protein digestion, peptide extraction and 
liquid chromatography (LC), and MS, in that order. 
(E) Venn diagram illustrating proteins identified in 
each experimental group that had =2 distinct 
peptides that were absent from negative control 
replicates. (F) Two proteins were identified in 

all positive control replicates: PIEZO1 and MDFIC. 
Alignment of distinct MDFIC peptides identified 
with MS. 


Fig. 2. Structural elucidation of the PIEZO1-MDFIC complex. (A and B) Cryo-EM 
density maps of the mouse PIEZO1-MDFIC complex at 3.66-A nominal resolution 
viewed from the top (A) and side (B), with the resolved MDFIC region colored green. 
(C) The distal C-terminal of MDFIC resides parallel to the bilayer between the anchor 
domain (a1 to a3) and the outer helix (OH). The cytoplasmic constriction residues 


pore module, nestling between the anchor do- 
main and the outer helix of PIEZO1 (Fig. 2, C 
and D), making contacts with His?”°, Phe?””°, 
and Gln”’”? from the al helix of the anchor 
domain; Val?"*’, Met”"!, and Ie”!*° from the 
outer helix; and Phe“ and Phe**® at the base 
of the inner helix that lines the PIEZO1 pore 
(Fig. 2E). The amphipathic helix of MDFIC con- 
sists of a sequence of five cysteines that point 
toward the bilayer interior (Fig. 2, E and F) and 
a sequence of four negatively charged residues 
that point toward the solvent, forming salt 
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bridges with multiple lysine residues (Fig. 2, F 
and G). The MDFIC C terminus penetrates far 
enough to come close to the cytoplasmic con- 
striction formed by Met”? and Phe” (Fig. 
2H) and residues critical for voltage-dependent 
inactivation, Lys” and Arg”*? (23, 24). Despite 
its central location, MDFIC binding did not 
influence the closed structure of the PIEZO1 
pore module (fig. S41). 

Because both PIEZO1 and MDFIC are es- 
sential for lymphatic development in mice and 
humans (19, 25, 26), we investigated using the 
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Met2493 and Phe2494 are shown in cyan. (D) The C-terminal region of MDFIC 
penetrates deep into the pore module of PIEZO1, approaching the inner helix (IH). 
(E to H) The interactions between PIEZO1 and MDFIC from the (E) membrane-facing 
view, (F) lateral view, and (G and H) cytoplasmic-facing view. Variants linked to 
lymphatic malformations (mPIEZO1 V2187 and mMDFIC F245) are labeled in bold. 


ClinVar database whether any disease-causing 
mutations were located within this binding 
interface. We found mutations in both PIEZO1 
(human V2171f; Fig. 2, E and G) and MDFIC 
(human F244L; Fig. 2, E and H) associated with 
human lymphatic disease. 


MDFIC and MDFI regulate Piezo gating 


Given the location of MDFIC binding, we next 
tested whether human MDFIC and MDFI- 
modified human PIEZO1 (hPIEZO1) gating in 
HEK298T cells. MDFIC and MDFI expressed 
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Fig. 3. MyoD-family inhibitor proteins regulate PIEZO1 and PIEZO2 channel 
gating. (A to E) Representative cell-attached patch-clamp recordings from 
hPIEZO1 (A) control and hPIEZO1 in the presence of (B) hMDFIC, (C) hMDFI, 
(D) the conserved C terminus of MDFIC (hMDFIC C-81), and (E) with MDFIC 
lacking its C-terminal 20 amino acids (hMDFIC AC20) all at a holding potential 
of -65 mV. (F to H) Quantification of (F) peak currents per patch, (G) percent 
current remaining, and (H) normalized current 1s after pressure release 
(normalized Ipost) for replicates of cell-attached recordings shown in (A) to (E). 
(I to L) Representative cell-attached recordings from (I) hPIEZO2 control and 
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alone did not generate stretch-activated cur- 
rents (fig. S5, A to D). Compared with hPIEZO1 
alone, coexpression with MDFIC resulted in a 
mild right shift in the pressure response curve 
[from 14.5 + 3.2 mmHg (n = 9 cells) to 23.9 + 
3.9 mmHg (n = 6)] (fig. S5, E to G), a marked 
increase in the peak stretch-evoked currents, a 
substantial slowing of channel inactivation, 


and continued channel gating even after the 
pressure was released (Fig. 3, A to H, and fig. 
S5). We quantified the latter of these effects 
using the current that remained 1 s after ap- 
plication of stretch (post). Neither MDFIC nor 
MDFI influenced mRNA TMEM150c, demon- 
strating that these inactivation effects were in- 
dependent of TMEM150c (fig. SSN) (27). MDFIC 


did not influence PIEZO1 protein amounts in 
either HEK293T or LNCaP cells transfected 
with MDFIC (fig. S6, A to D) but did increase 
PIEZO1 single-channel conductance from 26 + 
3pS (n = 4) to 48 + 8 pS (n = 4) (fig. S6, E to G). 
Thus, the increase in stretch-activated cur- 
rents in the presence of MDFIC is likely driven 
by changes in conductance and its strong 
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Fig. 4. C-terminal lipidation of MDFIC underlies regulation of PIEZO1 channel the presence of WT MDFIC and mutation of the five C-terminal cysteine residues 
gating. (A) Cryo-EM density map showing extension of density on cysteine residues to either alanine (5C-A) or serine (5C-S), which abolishes the regulatory effects 
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effect on inactivation. The slow “closure” after 
pressure release—identified in single-channel re- 
cordings (fig. S6, H to J) and from macrocurrents— 
was more pronounced at hyperpolarizing volt- 
ages (fig. S7, A and B), which suggests an effect 
on the structural transition-governing voltage- 
dependent inactivation (23, 24). Consistent with 
this, the channel could be rapidly closed by 
flipping the voltage to depolarizing potentials 
(fig. S7, C and D). We identified slower inactiv- 
ation in both cell-attached and whole-cell modes 
(fig. S8). Additionally, MDFIC expression did 
not influence the function of mechanosensitive 
TREK-1 channels, ruling out nonspecific effects 
(fig. S9). 

The C termini of MDFIC and MDF are highly 
homologous, whereas the N termini bear little 
resemblance and have no known function (fig. 
S10A); therefore, we asked whether the MyoD- 
inhibitor domain of MDFIC (residues 165 to 
246, which we name C-81) expressed alone could 
modify PIEZO1 function. Despite being unsta- 
ble, this domain reduced channel inactivation 
(Fig. 3, D and G, and fig. SIOB). Moreover, the 
amount of C-81 protein was dramatically in- 
creased when coexpressed with PIEZO1 (fig. 
S10, C and D). Conversely, truncation of the C- 
terminal 20 residues (AC20) prevented MDFIC 
from regulating PIEZO1 gating (Fig. 3, E to G). 
This truncation reduced the interaction with 
PIEZO1, and MDFIC AC20 protein amounts 
were only marginally affected by PIEZO] (fig. 
S10, D and E). Thus, functional regulation of 
the PIEZO1-MDFIC complex depends on the 
conserved C terminus of MDFIC. 

Given the homology of PIEZO1 and PIEZO2 
in the MDFIC-binding region, we additionally 
showed that the same regulation occurs for 
both hPIEZO2 and mouse Piezo orthologs 
(Fig. 3, I to L, and fig. S11). Moreover, the ex- 
pression of MDFIC and MDFI correlated with 
the native inactivating phenotype of PIEZO1 
in multiple human and mouse cell lines (fig. 
$12). This included HEK293T, in which fast 
PIEZO1 inactivation was seen, and prostate 
cancer cell lines (LNCaP and DU145), in which 
differential stretch-activated kinetics were pre- 
viously reported (9). HEK293T and LNCaP have 
almost undetectable amounts of MDFIC and 
MDF! and display fast inactivation, whereas 
DU145 has notable amounts of MDFI at the 
mRNA level and inactivates slowly (fig. S12, A 
to F). Overexpression of MDFIC in LNCaP cells 
markedly slowed inactivation of native PIEZO1 
channels (fig. S12, B to F). In mouse embryonic 
fibroblasts and C2C12 myoblasts, we detected 
MDFIC, and both cell types displayed slow 
PIEZO1 inactivation indicated by larger cur- 
rent remaining at the end of pressure pulses (fig. 
S12, G to I). By contrast, in Neuro2A (N2A) cells, 
PIEZO1 channels exhibited rapid inactivation with 
undetectable expression of MDFIC and MDFI. 

We next asked whether genetic loss of MDFIC 
could modify the kinetics of native PIEZO1 cur- 
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rents. To investigate this, we used a mouse mod- 
el of a complex lymphatic anomaly known as 
central conducting lymphatic anomaly, bear- 
ing atruncated variant of MDFIC that lacked 
the complete conserved C-terminal but retained 
the N-terminal region (19). Our assessment of 
PIEZO1 activity in embryonic cardiac fibro- 
blasts isolated from embryonic day 16.5 (E16.5) 
wild-type (WT/WT), heterozygous (WT/M131fs*), 
and homozygous mice (M131fs*/M131fs*) re- 
vealed that fibroblasts harboring C-terminally 
truncated MDFIC had smaller PIEZO1 cur- 
rents that inactivated more rapidly (Fig. 3, 
M to R). This effect on inactivation was pheno- 
copied by silencing MDFIC in fibroblasts 
with small interfering RNA (siRNA) (fig. S13, 
Dto J). 


PIEZO1 regulation requires MDFIC lipidation 


Cryo-EM maps of MDFIC revealed extra den- 
sities on Cys”, Cys”®”, Cys’, Cys”, and Cys“ 
(Fig. 4A). Given that this helix is situated at the 
membrane interface and contains motifs for 
cysteine lipidation (28), we hypothesized that 
these extra densities resulted from lipidation, 
which is the covalent addition of acyl chains to 
amino acids (29). A second cryo-EM complex 
of the MDFIC mutant Cys240Ala (3.68 A) spe- 
cifically lacked the extra density at Cys’, which 
is consistent with posttranslational modification 
(fig. $14). Acyl biotin exchange confirmed that 
MDFIC was palmitoylated (Fig. 4B). Mass tagging 
with polyethylene gycol-modified (PEGylated) 
maleimide (increases mass per palmitoylated 
site) showed that there were at least three sites 
for lipidation on MDFIC (Fig. 4C), that two were 
in the distal C terminus (fig. S15), and that like 
most lipidated proteins, MDFIC associated with 
the plasma membrane (Fig. 4D). 

To check whether MDFIC lipidation is func- 
tionally relevant, we mutated all five C-terminal 
cysteines to alanine or serine. MDFIC with 
Cys?23/237/240/241/244 ‘Ala (5C-A) or Cys?88/287/ 
240/241/244¢ 67 (5C-S) bound to PIEZOL, but nei- 
ther mutant influenced PIEZO] activity, which 
suggests that the lipidation of the MDFIC 
C terminus is critical for PIEZO1 regulation 
(Fig. 4, E to H, and fig. S15C). 

To probe the role of lipidation, we used all- 
atom molecular-dynamics simulations of the 
PIEZOI1 pore module (residues 1956 to 2547) in 
complex with the MDFIC C terminus (Fig. 4, I 
to K, and fig. S16). Simulations consisted of 
three MDFIC monomers: one unmodified, one 
lipidated at all five cysteine residues in the 
helix, and one where each cysteine was mutated 
to serine (5C-S mutants) (Fig. 41). All variations 
of the MDFIC C terminus remained stably bound 
(fig. S16, A and B). An overlay of a snapshot 
from simulations shows that the acyl chains 
coincide with the additional MDFIC cryo-EM 
densities (fig. SI6C). We compared all inter- 
actions that occurred in either unmodified or 
lipidated MDFIC monomers with those in the 
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5C-S mutant, which we know binds to but does 
not modulate PIEZO1 function (Fig. 4J and fig. 
$16). Interactions between the unmodified and 
5C-S MDFIC were indistinguishable, which sup- 
ports the critical role of MDFIC lipidation in 
modulating PIEZO1 function. By contrast, the 
lipidated MDFIC had multiple distinct inter- 
actions with inner-helix residues (Fig. 4, J and 
K, and fig. S16, D and E). Some of these resi- 
dues within the inner helix are critical for in- 
activation, including Leu2475 (30), making 
this the likely pathway for functional modifi- 
cation. Given that MDFIC would need to stay 
bound throughout the PIEZO1 conformational 
cycle, we also examined whether MDFIC could 
bind to the flattened state of PIEZO1 (37). When 
MDFIC was aligned to the same pocket, sim- 
ulations showed that it interacted with the 
inner helix and remained stably bound (fig. 
S16, F to H). 


Discussion 


Using AC-MS, we have identified a family of 
Piezo-channel binding partners—the MyoD- 
family inhibitor proteins, MDFI and MDFIC— 
that fit all the criteria for Piezo-channel aux- 
iliary subunits. Although our study does not 
resolve full-length MDFIC or its native mem- 
brane topology, it does reveal that MDFIC binds 
to the PIEZO1 pore module through its con- 
served C terminus, which regulates channel 
inactivation. The regulation of Piezo-channel 
inactivation critically involves palmitoylation 
of the distal C-terminal of MDFIC. Because 
palmitoylation is a reversible lipid addition (29), 
this adds the potential for dynamic spatiotem- 
poral regulation of Piezo inactivation by MDFIC. 
Although we identified few other Piezo-channel 
binding-partner candidates with our “fibroblast- 
centric” screen, we speculate that the binding 
region of MDFIC could form a conserved bind- 
ing site for alternate membrane-associated Piezo 
regulators or auxiliary subunits. In support of 
this hypothesis, the PIEZO1 lysine residues 
that form salt bridges with MDFIC have been 
proposed to interact with other proteins (32). 
Despite PIEZO1 channels being able to func- 
tion as independent mechanosensors in sim- 
plified systems (2), ample evidence suggests 
that Piezos, particularly PIEZO2, may receive 
force through molecular tethers (33). Given 
the location of binding, it seems unlikely that 
MDFIC could act as a tethering molecule. 

It remains to be seen how ubiquitous this 
regulatory mechanism is; MDFIC and MDFI 
are absent from many cell types with rapidly in- 
activating PIEZO1 channels, including LNCaP 
(9) and N2A (1), but are expressed in others, 
including fibroblasts (10, 12, 13) and endothe- 
lial cells that exhibit slower PIEZO1 inactiva- 
tion (8, 11, 14). We isolated cardiac fibroblasts 
from mice that harbored a truncated MDFIC 
lacking the C terminus (79), in which PIEZO1 
exhibited faster inactivation than in wild-type 
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fibroblasts. Thus, MDFIC- or MDFI-mediated 
regulation of Piezo channels could be widely 
used. Despite our study not defining a PIEZO1- 
MDFIC complex in lymphatic endothelial cells, 
the similar lymphatic phenotypes associated 
with loss of function of MDFIC (19) and PIEZO1 
(25, 26) means that this interaction may unearth 
molecular aspects underlying lymphatic vas- 
cular disease. Furthermore, both MDFIC and 
MDFI bind to transcription factors (19, 34) [in- 
cluding GATA2, a master regulator of lymphatic 
valve development (35)]. Whether PIEZO1/2 
channels influence this aspect of their func- 
tion is unknown but may lay the foundation 
for the unveiling of a direct mechanosignaling 
pathway by means of Piezos to transcription 
through GATA2 or other transcription factors. 
Thus, our structural and functional data not 
only reveal a family of Piezo-channel auxiliary 
subunits but may also reveal a conserved bind- 
ing site for other membrane-associated pro- 
teins that modulate Piezo-channel function 
and, in doing so, the basis for an alternate Piezo- 
dependent mechanosignaling pathway. 
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WORKING LIFE 


By Tomasz Gtowacki 


806 


Teaching workplace skills 


he email got my heart racing. “Thank you for letting me observe your classes. It was an unusual 
experience,’ it began. I am an industry scientist, and in my spare time I was teaching a course 
for computer science students at a local university. By that point in the semester, I hadn’t asked 
the students to solve any technical problems or even open their computers. Instead, I’d focused 
on practical exercises that taught them teamwork and communication, skills they’d need in in- 
dustry. My approach caught the attention of the university, so a teaching expert was dispatched. 
The note made me wonder whether university officials would hail the unique perspective I brought 


from industry—or remove me from the classroom. 


Ever since I was a child, I admired 
my mother, who was a teacher, and 
I wanted to follow in her footsteps. 
During my own studies, I enjoyed 
serving as a teaching assistant, help- 
ing students understand compli- 
cated concepts and witnessing their 
“aha” moments. But upon finishing 
my Ph.D., I decided against pursuing 
an academic position after realizing 
that although my research was well- 
regarded, it was not making any 
practical difference. I felt unfulfilled 
and discontented, and I eventually 
transitioned into industry. 

Over the past 10 years, I have 
worked a variety of jobs. I used my 
analytical skills to uncover insur- 
ance fraud at a consulting company. 
I built and managed teams at a tech- 
nology company. And, in my current 
job, I oversee a team of people that 
uses data science to boost company 
efficiency and enable better decision-making. In all these 
roles in business, I have tapped into my scientific expertise. 
But I have also had the opportunity to develop skills I didn’t 
learn in academia. 

At the consulting company, for instance, I worked with 
software developers, insurance experts, and lawyers. This 
helped me learn how to communicate what I was doing in 
a nontechnical way so everyone could understand. Later, at 
the technology company, I learned about the importance of 
team dynamics and collaboration after recruiting a highly 
skilled individual, complete with a Ph.D., who seemed the 
right fit for the job. It quickly became apparent, however, 
that he had difficulties dealing with others. Some of his col- 
leagues called him a “smart jerk” behind his back. From 
then on, I didn’t just look at someone’s technical expertise 
when hiring; I also wanted to know what soft skills they 
could bring to the table. 


“| decided to use games 
and challenges to encourage 
teamwork and creativity.” 


I thought of these lessons when 
I started to develop the university 
class. The part-time role was ap- 
pealing because I was eager to share 
my expertise and get back to work- 
ing with students. But I didn’t want 
to use my old teaching style, which 
focused on lectures and whiteboard 
exercises. I wanted to cultivate the 
same qualities I was looking for in 
job candidates at my company. 

I decided to use games and chal- 
lenges to encourage teamwork and 
creativity. In one, students had to 
work together to decide how to sur- 
vive a harsh desert environment. In 
another, they built the tallest struc- 
ture they could out of spaghetti and 
marshmallows. The teaching expert 
sat in on one such class. 

I worried they might have been 
taken aback. But after that un- 
settling opening, their email was 
positive: “I understand the important message you're con- 
veying,’ they wrote. “Today’s industry is about people, not 
just computers. I fully support you. Keep up the good work.” 

I’ve now been teaching this course for 4 years, enough 
time for some of my students to have graduated and taken 
jobs in industry. At a recent networking event, some of 
them told me how the skills and knowledge they gained 
in my class helped them land their jobs. Hearing that my 
teaching had a positive impact on their career paths was 
truly rewarding. 

In the end, education is not just about imparting knowl- 
edge. It’s also about helping students develop soft skills that 
will help them succeed wherever they plan to go, be it aca- 
demia or industry. 


Tomasz Gtowacki is the head of data science at Zabka, a company in 
Poland, and an assistant professor at WSB Merito University Poznan. 
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